The Right Answer, the Wrong Direction

Why Transformers Fail at Counting and How to Fix It

[NeurIPS 2026 Submission] | arXiv (full version)

Transformers fail at counting not because they can't represent counts, but because the output pathway can't route the answer. Linear probes recover counts at $R^2>0.99$ from every layer, yet the count-encoding direction is nearly orthogonal to the output head's digit rows ($|\cos| \leq 0.032$). We trace this to a geometric readout bottleneck, localize it with a 9-row output-head repair, and resolve it with LoRA Q/V attention routing.

Paper

arXiv (full, 26 pages): arxiv.org/abs/2605.03258 | paper/main.tex
NeurIPS 2026 (9 pages, anonymized): paper/main_neurips.pdf | paper/main_neurips.tex

Reproduce

All experiments can be reproduced from the supplement. See supplement/README.md for the full guide.

cd supplement/code
python data_generation.py                          # Generate benchmark
python run_phase112_fullvocab_all_tasks.py         # 9-row repair
python run_phase118_lora_generation.py             # LoRA Q/V generation
python run_phase122_cot.py                         # CoT baseline

Results match the paper exactly — verified on the same TPU VM (PyTorch, CPU mode).

Key numbers

Experiment	Metric	Result
Probe R²	Layer 2+	>0.99
9-row repair	Constrained	60.7--100.0%
LoRA Q/V generation	5 seeds	83.1% ± 7.2%
CoT baseline	Few-shot	20.2% ± 1.9%

Repository

.
├── paper/              # LaTeX source, PDFs, figures, checklist
├── supplement/
│   ├── code/           # 16 Python experiment scripts
│   ├── results/        # 14 primary result JSONs
│   └── figures/        # All paper figures (PDF + PNG)
└── README.md

Citation

@inproceedings{garcia2026right,
  title={The Right Answer, the Wrong Direction: Why Transformers
         Fail at Counting and How to Fix It},
  author={Garcia, Gabriel},
  booktitle={Advances in Neural Information Processing Systems},
  year={2026}
}

License

Code: MIT. Paper, figures, and text: CC BY 4.0.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Right Answer, the Wrong Direction

Paper

Reproduce

Key numbers

Repository

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
paper		paper
supplement		supplement
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

The Right Answer, the Wrong Direction

Paper

Reproduce

Key numbers

Repository

Citation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages