Disentangle-then-Refine: LLM-Guided Decoupling and Structure-Aware Refinement for Graph Contrastive Learning

arXiv cs.AI / 4/17/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • Traditional graph contrastive learning for text-attributed graphs uses random stochastic augmentations that can mix task-relevant signals with noise.
  • The proposed SDM-SCR framework uses an LLM-driven Semantic Decoupling Module to transform raw attributes into separate asymmetric views of semantic signal versus noise.
  • A Semantic Consistency Regularization step applies a spectral, structure-aware selective filter that enforces consistency only in the signal subspace while suppressing high-frequency noise.
  • The “Disentangle-then-Refine” design aims to purify semantic signals and reduce issues such as LLM hallucinations without causing harmful over-smoothing.
  • Experiments reported for SDM-SCR indicate state-of-the-art performance, improving both accuracy and efficiency over prior approaches.

Abstract

Conventional Graph Contrastive Learning (GCL) on Text-Attributed Graphs (TAGs) relies on blind stochastic augmentations, inadvertently entangling task-relevant signals with noise. We propose SDM-SCR, a robust framework anchored in Approximate Orthogonal Decomposition. First, the Semantic Decoupling Module (SDM) leverages the instruction-following capability of Large Language Models (LLMs) to actively parse raw attributes into asymmetric, task-oriented signal and noise views. This shifts the paradigm from random perturbation to semantic-aware disentanglement. Subsequently, Semantic Consistency Regularization (SCR) exploits the spectral observation that semantic signals are topologically smooth while residual noise is high-frequency. SCR functions as a selective spectral filter, enforcing consistency only on the signal subspace to eliminate LLM hallucinations without over-smoothing. This ``Disentangle-then-Refine'' mechanism ensures rigorous signal purification. Extensive experiments demonstrate that SDM-SCR achieves SOTA performance in accuracy and efficiency.