Multilingual white-box + black-box framework probing semantic grounding vs stochastic parroting in LLMs (Hinton hypothesis).
multilingual benchmark evaluation hinton interpretability world-models ai-evaluation llm mechanistic-interpretability stochastic-parrot semantic-grounding
-
Updated
Jun 8, 2026 - Python