AI & ML New Capability

Polaris introduces a 'Gödel Agent' framework that allows 7B-parameter models to recursively improve their own policies through auditable code patches.

March 25, 2026

Original Paper

Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair

Aditya Kakade, Vivek Srivastava, Shirish Karande

arXiv · 2603.23129

The Takeaway

Unlike standard self-correction which happens at the response level, this framework enables persistent, cumulative policy refinement. It demonstrates that smaller, compact models can perform autonomous strategy abstraction and self-repair previously reserved for much larger systems.

From the abstract

Gödel agent realize recursive self-improvement: an agent inspects its own policy and traces and then modifies that policy in a tested loop. We introduce Polaris, a Gödel agent for compact models that performs policy repair via experience abstraction, turning failures into policy updates through a structured cycle of analysis, strategy formation, abstraction, and minimal code pat ch repair with conservative checks. Unlike response level self correction or parameter tuning, Polaris makes policy le

Read the original paper →

← Back to today's papers