AI Navigate

Discovering Decoupled Functional Modules in Large Language Models

arXiv cs.LG / 3/19/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes the ULCMOD framework, an unsupervised cross-layer module discovery method that disentangles the entire LLM neuron set into functional modules and associates modules with topics of input samples.
  • It introduces a novel objective function and an efficient IterD (Iterative Decoupling) algorithm to perform this discovery.
  • Extensive experiments show the discovered modules are disentangled, semantically meaningful, and exhibit interpretable specializations with clear spatial and hierarchical organization within the model.
  • The work aims to advance interpretability and trustworthiness of LLMs by providing a new tool for analyzing functional modularity.

Abstract

Understanding the internal functional organization of Large Language Models (LLMs) is crucial for improving their trustworthiness and performance. However, how LLMs organize different functions into modules remains highly unexplored. To bridge this gap, we formulate a functional module discovery problem and propose an Unsupervised LLM Cross-layer MOdule Discovery (ULCMOD) framework that simultaneously disentangles the large set of neurons in the entire LLM into modules while discovering the topics of input samples related to these modules. Our framework introduces a novel objective function and an efficient Iterative Decoupling (IterD) algorithm. Extensive experiments show that our method discovers high-quality, disentangled modules that capture more meaningful semantic information and achieve superior performance in various downstream tasks. Moreover, our qualitative analysis reveals that the discovered modules show semantic coherence, correspond to interpretable specializations, and a clear spatial and hierarchical organization within the LLM. Our work provides a novel tool for interpreting the functional modules of LLMs, filling a critical blank in LLM's interpretability research.