LINE: LLM-based Iterative Neuron Explanations for Vision Models

arXiv cs.CV / 4/10/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces LINE, a training-free, iterative method for labeling and explaining neuron-level concepts in vision models using an open-vocabulary approach.
  • LINE operates in a strict black-box setting by using an LLM and a text-to-image generator in a closed loop, with proposals guided by the neuron activation history.
  • Experiments report state-of-the-art results across multiple architectures, including AUC gains of up to 0.18 on ImageNet and 0.05 on Places365.
  • The method reportedly discovers new concepts not covered by large predefined vocabularies, finding on average 29% new concepts that those vocabularies miss.
  • LINE also outputs a full generation history and visual explanations, enabling analyses such as polysemanticity evaluation and comparisons to gradient-dependent activation maximization approaches.

Abstract

Interpreting the concepts encoded by individual neurons in deep neural networks is a crucial step towards understanding their complex decision-making processes and ensuring AI safety. Despite recent progress in neuron labeling, existing methods often limit the search space to predefined concept vocabularies or produce overly specific descriptions that fail to capture higher-order, global concepts. We introduce LINE, a novel, training-free iterative approach tailored for open-vocabulary concept labeling in vision models. Operating in a strictly black-box setting, LINE leverages a large language model and a text-to-image generator to iteratively propose and refine concepts in a closed loop, guided by activation history. We demonstrate that LINE achieves state-of-the-art performance across multiple model architectures, yielding AUC improvements of up to 0.18 on ImageNet and 0.05 on Places365, while discovering, on average, 29% of new concepts missed by massive predefined vocabularies. Beyond identifying the top concept, LINE provides a complete generation history, which enables polysemanticity evaluation and produces supporting visual explanations that rival gradient-dependent activation maximization methods.