On the Expressive Power and Limitations of Multi-Layer SSMs
arXiv cs.LG / 4/17/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper studies how multi-layer state-space models (SSMs) differ from streaming models in compositional tasks, identifying a fundamental expressiveness gap.
- It analyzes the effect of chain-of-thought (CoT), finding that offline CoT does not fundamentally increase expressiveness, while online CoT can substantially boost it.
- With online CoT, multi-layer SSMs are shown to be equivalent in power to streaming algorithms for the considered setting.
- The authors explore the width–precision tradeoff, concluding that width and precision are not interchangeable in the base model but become cleanly equivalent when online CoT is permitted.
- The work provides a unified framework describing how depth, finite precision, and CoT collectively determine what SSMs can and cannot do.


![[2026] OpenTelemetry for LLM Observability — Self-Hosted Setup](/_next/image?url=https%3A%2F%2Fmedia2.dev.to%2Fdynamic%2Fimage%2Fwidth%3D1200%2Cheight%3D627%2Cfit%3Dcover%2Cgravity%3Dauto%2Cformat%3Dauto%2Fhttps%253A%252F%252Fdev-to-uploads.s3.amazonaws.com%252Fuploads%252Farticles%252Flu4b6ttuhur71z5gemm0.png&w=3840&q=75)
