The Illusion of Superposition? A Principled Analysis of Latent Thinking in Language Models

arXiv cs.CL / 4/9/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper studies whether language models genuinely exploit “superposition” when using continuous latent chain-of-thought (Latent CoT) reasoning, rather than relying on theoretical claims alone.
It evaluates three setups—training-free convex combinations, fine-tuning to generate latent thoughts, and from-scratch training—and finds that only from-scratch models show signs consistent with superposition use.
In the training-free and fine-tuned regimes, the purported superposition typically collapses or is not meaningfully utilized, with models instead converging on shortcut solutions.
The authors attribute these outcomes to (1) natural-language pretraining biases that cause commitment to a single token in later layers and (2) model capacity effects that shape which candidate solutions become favored.
Overall, the work provides a principled explanation for when superposition emerges in continuous CoT and when it fails, outlining conditions under which it collapses.

Abstract

Latent reasoning via continuous chain-of-thoughts (Latent CoT) has emerged as a promising alternative to discrete CoT reasoning. Operating in continuous space increases expressivity and has been hypothesized to enable superposition: the ability to maintain multiple candidate solutions simultaneously within a single representation. Despite theoretical arguments, it remains unclear whether language models actually leverage superposition when reasoning using latent CoTs. We investigate this question across three regimes: a training-free regime that constructs latent thoughts as convex combinations of token embeddings, a fine-tuned regime where a base model is adapted to produce latent thoughts, and a from-scratch regime where a model is trained entirely with latent thoughts to solve a given task. Using Logit Lens and entity-level probing to analyze internal representations, we find that only models trained from scratch exhibit signs of using superposition. In the training-free and fine-tuned regimes, we find that the superposition either collapses or is not used at all, with models discovering shortcut solutions instead. We argue that this is due to two complementary phenomena: i) pretraining on natural language data biases models to commit to a token in the last layers ii) capacity has a huge effect on which solutions a model favors. Together, our results offer a unified explanation for when and why superposition arises in continuous chain-of-thought reasoning, and identify the conditions under which it collapses.

Meta Superintelligence Lab Releases Muse Spark: A Multimodal Reasoning Model With Thought Compression and Parallel Agents

MarkTechPost

Chatbots are great at manipulating people to buy stuff, Princeton boffins find

The Register

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Dev.to

v0.20.5

Ollama Releases

Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos

Dev.to

The Illusion of Superposition? A Principled Analysis of Latent Thinking in Language Models

Key Points

Abstract

Related Articles

Meta Superintelligence Lab Releases Muse Spark: A Multimodal Reasoning Model With Thought Compression and Parallel Agents

Chatbots are great at manipulating people to buy stuff, Princeton boffins find

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

v0.20.5

Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer