Autonomous SKILL.md improvement via Karpathy autoresearch-inspired iterative eval hill-climbing with force iteration, NEUTRAL mutation judgment, mutation exclusions, L1/L2/L3 eval levels, and external verification integration