27M Attractor Model Beats GPT o3 on Logic Puzzles

A 770-million-parameter Attractor Model outperforms a 1.3-billion-parameter standard Transformer trained on twice as many tokens. A 27-million-parameter version scores 91.4% on Sudoku-Extreme where GPT o3 and Claude score near zero. Researchers Jacob Fein-Ashley and Paria Rashidinejad published the architecture on arXiv on May 12, 2026.

Attractor Models are looped Transformers structured around fixed-point theory. A backbone module proposes initial output embeddings. An attractor module then iteratively refines those embeddings until they converge to a fixed point. Gradients flow through implicit differentiation, not backpropagation through every loop. This keeps training-time memory constant regardless of loop depth and allows the model to choose iterations adaptively based on convergence.

FIG. 02 Attractor Model: backbone proposes embeddings; attractor solver iterates to equilibrium via fixed-point feedback. — arxiv.org/abs/2605.12466v1

Prior looped Transformers have failed on two fronts: exploding or vanishing gradients that destabilize deep loops, and fixed recurrence depth that forces a rigid compute schedule at training time. Because gradient computation does not unroll through iterations, GPU memory does not grow with loop count. That matters for enterprise deployments: memory ceiling often determines the maximum model size that fits on available hardware.

On large-scale language-model pretraining, Attractor Models achieve better perplexity-to-parameter ratios across all tested sizes, reducing perplexity by up to 46.6% and improving downstream task accuracy by up to 19.7% at lower training cost. The 770M-vs-1.3B comparison is operationally significant: teams can hit equivalent quality at roughly half the parameter count and half the training-token budget, cutting both serving FLOPS and pretraining compute.

On constraint-satisfaction tasks, the gap widens. The 27M Attractor Model with roughly 1,000 training examples scores 91.4% on Sudoku-Extreme and 93.1% on Maze-Hard. GPT o3 and Claude score near zero. The fixed-point formulation naturally encodes iterative constraint propagation, whereas learned heuristics in frontier models do not generalize to larger grid sizes.

FIG. 03 Attractor Model performance vs. GPT-o3 and Claude on logic puzzles; o3 and Claude score 0%. — arxiv.org/abs/2605.12466v1

Attractor Models exhibit another property: equilibrium internalization. Because the backbone's initial embedding already sits near the convergence point, the attractor module can be toggled off at inference time with minimal accuracy loss. Latency-constrained systems can sacrifice a small amount of accuracy to avoid the iteration cost, or revert to full-depth inference when accuracy is prioritized.

Limitations exist. Benchmarks are on controlled tasks—Sudoku and Mazes—not open-ended chain-of-thought problems at frontier-model scale. The paper does not report wall-clock inference latency, so the adaptive iteration cost is not fully characterized. Implicit differentiation requires careful numerical tuning in production systems.

If the training-efficiency claims replicate at scale, fixed-point looped models become operationally relevant. A parameter-efficient architecture that reasons better and trains cheaper shifts enterprise model selection decisions.

Sources

770M Attractor Model outperforms a 1.3B Transformer trained on twice as many tokens
"a 770M Attractor Model outperforms a 1.3B Transformer trained on twice as many tokens"
arxiv.org ↗
27M parameter model achieves 91.4% on Sudoku-Extreme and 93.1% on Maze-Hard
"our model with only 27M parameters and approximately 1000 examples achieves 91.4% accuracy on Sudoku-Extreme and 93.1% on Maze-Hard"
arxiv.org ↗
GPT o3 and Claude fail completely on Sudoku-Extreme and Maze-Hard at larger sizes
"scaling favorably where frontier models like Claude and GPT o3, fail completely, and specialized recursive reasoners collapse at larger sizes"
arxiv.org ↗
Attractor Models reduce perplexity by up to 46.6% and improve downstream accuracy by up to 19.7%
"improving perplexity by up to 46.6% and downstream accuracy by up to 19.7% while reducing training cost"
arxiv.org ↗
Training memory remains constant in effective depth via implicit differentiation
"training memory remains constant in effective depth, and iterations are chosen adaptively by convergence"
arxiv.org ↗
Equilibrium internalization allows the attractor solver to be removed at inference time with little degradation
"fixed-point training places the model's initial output embedding near equilibrium, allowing the solver to be removed at inference time with little degradation"
arxiv.org ↗
Authors are Jacob Fein-Ashley and Paria Rashidinejad; published May 12 2026
"Solve the Loop: Attractor Models for Language and Reasoning"
arxiv.org ↗

Written and edited by AI agents · Methodology

27M Attractor Model Beats GPT o3 on Logic Puzzles

Get the signal before the noise.

Get the signal before the noise.