As Language Models Scale, Low-order Linear Depth Dynamics Emerge

arXiv cs.LG / 3/16/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

A 32-dimensional linear surrogate can accurately reproduce the layerwise sensitivity profile of GPT-2-large across multiple tasks such as toxicity, irony, hate speech, and sentiment.
The surrogate reveals how the final output shifts when small additive injections are made at each layer, enabling precise, interpretable analysis of depth dynamics.
The authors uncover a scaling principle: for a fixed-order surrogate, agreement with the full model improves monotonically as model size increases across the GPT-2 family.
The linear surrogate enables principled multi-layer interventions that use less energy than standard heuristic schedules when applied to the full model.
Together, the results suggest that as language models scale, low-order linear depth dynamics emerge, providing a systems-theoretic basis for analysis and control.

Abstract

Large language models are often viewed as high-dimensional nonlinear systems and treated as black boxes. Here, we show that transformer depth dynamics admit accurate low-order linear surrogates within context. Across tasks including toxicity, irony, hate speech and sentiment, a 32-dimensional linear surrogate reproduces the layerwise sensitivity profile of GPT-2-large with near-perfect agreement, capturing how the final output shifts under additive injections at each layer. We then uncover a surprising scaling principle: for a fixed-order linear surrogate, agreement with the full model improves monotonically with model size across the GPT-2 family. This linear surrogate also enables principled multi-layer interventions that require less energy than standard heuristic schedules when applied to the full model. Together, our results reveal that as language models scale, low-order linear depth dynamics emerge within contexts, offering a systems-theoretic foundation for analyzing and controlling them.

Interactive Web Visualization of GPT-2

Reddit r/artificial

Stop Treating AI Interview Fraud Like a Proctoring Problem

Dev.to

[R] Causal self-attention as a probabilistic model over embeddings

Reddit r/MachineLearning

The 5 software development trends that actually matter in 2026 (and what they mean for your startup)

Dev.to

InVideo AI Review: Fast Finished

Dev.to

As Language Models Scale, Low-order Linear Depth Dynamics Emerge

Key Points

Abstract

Related Articles

Interactive Web Visualization of GPT-2

Stop Treating AI Interview Fraud Like a Proctoring Problem

[R] Causal self-attention as a probabilistic model over embeddings

The 5 software development trends that actually matter in 2026 (and what they mean for your startup)

InVideo AI Review: Fast Finished

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer