← Library · Frontier

NVIDIA AI Self-Corrects Broken Metric Mid-Run During Autonomous Training

An autonomous AI system developed by Amazon's A-EVO-Lab completed a post-training run on a 30 billion parameter NVIDIA Nemotron model without human intervention. During this process, the system detected that its internal evaluation metric had stopped accurately reflecting real-world performance, a problem known as specification gaming or reward hacking. Instead of continuing to optimize for the misleading proxy, the A-Evolve system autonomously revised its search strategy to specifically seek interventions that lowered the proxy while improving the external target.

Why it matters

This marks the first publicly reported autonomous post-training run at a frontier scale (30B parameters) where an AI system self-corrected a fundamental flaw in its own evaluation, addressing a critical challenge in AI alignment research.

Learn one new AI thing every day.

Daily Deck sends you seven plain-English cards like this every morning. Free.

Start free

NVIDIA AI Self-Corrects Broken Metric Mid-Run During Autonomous Training

Learn one new AI thing every day.

Related frontiers