Exploring Concept Subspace for Self-explainable Text-Attributed Graph Learning

arXiv cs.LG / 4/15/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • This paper proposes Graph Concept Bottleneck (GCB), a new paradigm for self-explainable learning on text-attributed graphs by mapping graphs into a concept bottleneck space of meaningful phrase-level concepts.
  • Predictions are driven by the activation of these concepts, offering a form of interpretability that differs from prior interpretable approaches that primarily use explanatory subgraphs.
  • The authors refine the concept space using the information bottleneck principle to keep only the most relevant concepts, producing explanations that are both more concise and more faithful.
  • Empirical results indicate GCB achieves “intrinsic interpretability” with accuracy comparable to black-box graph neural networks.
  • GCB also shows improved robustness and generalizability, performing better under distribution shifts and data perturbations due to concept-guided prediction.

Abstract

We introduce Graph Concept Bottleneck (GCB) as a new paradigm for self-explainable text-attributed graph learning. GCB maps graphs into a subspace, concept bottleneck, where each concept is a meaningful phrase, and predictions are made based on the activation of these concepts. Unlike existing interpretable graph learning methods that primarily rely on subgraphs as explanations, the concept bottleneck provides a new form of interpretation. To refine the concept space, we apply the information bottleneck principle to focus on the most relevant concepts. This not only yields more concise and faithful explanations but also explicitly guides the model to "think" toward the correct decision. We empirically show that GCB achieves intrinsic interpretability with accuracy on par with black-box Graph Neural Networks. Moreover, it delivers better performance under distribution shifts and data perturbations, showing improved robustness and generalizability, benefitting from concept-guided prediction.