Semantic Sections: An Atlas-Native Feature Ontology for Obstructed Representation Spaces
arXiv cs.LG / 2026/3/24
💬 オピニオンIdeas & Deep AnalysisModels & Research
要点
- The paper argues that treating interpretability “features” as single global directions or shared latent coordinates can fail when representations are obstructed and locally coherent meanings do not globally assemble.
- It introduces “semantic sections,” an atlas-native ontology: a transport-compatible family of local feature representatives defined over a context atlas, with formal properties distinguishing globalizable vs holonomy-obstructed (“twisted”) meanings.
- The authors prove that tree-supported propagation is pathwise realizable and identify cycle consistency as the key criterion for true globalization of local semantics.
- They propose a discovery-and-certification pipeline using seeded propagation, synchronization on overlaps, defect-based pruning, cycle-aware taxonomy, and deduplication, and apply it to layer-16 atlases for Llama 3.2 3B Instruct, Qwen 2.5 3B Instruct, and Gemma 2 2B IT.
- Empirically, semantic identity cannot be reliably recovered by raw global-vector similarity, while section-based identity recovery is perfect on certified supports, supporting semantic sections as a better feature ontology in obstructed regimes.

