LLM-Enhanced Energy Contrastive Learning for Out-of-Distribution Detection in Text-Attributed Graphs

arXiv cs.AI / 3/24/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper targets node-level out-of-distribution (OOD) detection in text-attributed graphs, where training/test distribution mismatch can severely degrade node classification performance.
  • It introduces LECT (LLM-Enhanced Energy Contrastive Learning), combining large language models (LLMs) with energy-based contrastive learning to separate in-distribution (IND) from OOD nodes.
  • LECT generates dependency-aware pseudo-OOD samples using LLM semantic understanding and contextual knowledge, enabling higher-quality OOD augmentation.
  • Experiments across six benchmark datasets show LECT consistently outperforms existing state-of-the-art baselines while maintaining both high classification accuracy and robust OOD detection.

Abstract

Text-attributed graphs, where nodes are enriched with textual attributes, have become a powerful tool for modeling real-world networks such as citation, social, and transaction networks. However, existing methods for learning from these graphs often assume that the distributions of training and testing data are consistent. This assumption leads to significant performance degradation when faced with out-of-distribution (OOD) data. In this paper, we address the challenge of node-level OOD detection in text-attributed graphs, with the goal of maintaining accurate node classification while simultaneously identifying OOD nodes. We propose a novel approach, LLM-Enhanced Energy Contrastive Learning for Out-of-Distribution Detection in Text-Attributed Graphs (LECT), which integrates large language models (LLMs) and energy-based contrastive learning. The proposed method involves generating high-quality OOD samples by leveraging the semantic understanding and contextual knowledge of LLMs to create dependency-aware pseudo-OOD nodes, and applying contrastive learning based on energy functions to distinguish between in-distribution (IND) and OOD nodes. The effectiveness of our method is demonstrated through extensive experiments on six benchmark datasets, where our method consistently outperforms state-of-the-art baselines, achieving both high classification accuracy and robust OOD detection capabilities.