Graph self-supervised learning based on frequency corruption

arXiv cs.LG / 4/20/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes FC-GSSL, a frequency-aware graph self-supervised learning method that creates “corrupted” graphs biased toward high-frequency signals.
  • Instead of only learning from corrupted inputs, FC-GSSL uses an autoencoder to reconstruct low-frequency and general features as supervision targets, encouraging fusion across multiple frequency bands.
  • The method includes multiple sampling strategies to generate diverse corrupted graphs, combining results via intersections and unions to produce different training views.
  • By aligning node representations across these views, FC-GSSL aims to find effective frequency combinations while reducing overreliance on specific local high-frequency patterns.
  • Experiments on 14 datasets across node classification, graph prediction, and transfer learning show consistent improvements in both performance and generalization compared with prior approaches.

Abstract

Graph self-supervised learning can reduce the need for labeled graph data and has been widely used in recommendation, social networks, and other web applications. However, existing methods often underuse high-frequency signals and may overfit to specific local patterns, which limits representation quality and generalization. We propose Frequency-Corrupt Based Graph Self-Supervised Learning (FC-GSSL), a method that builds corrupted graphs biased toward high-frequency information by corrupting nodes and edges according to their low-frequency contributions. These corrupted graphs are used as inputs to an autoencoder, while low-frequency and general features are reconstructed as supervision targets, forcing the model to fuse information from multiple frequency bands. We further design multiple sampling strategies and generate diverse corrupted graphs from the intersections and unions of the sampling results. By aligning node representations from these views, the model can discover useful frequency combinations, reduce reliance on specific high-frequency components, and improve robustness. Experiments on 14 datasets across node classification, graph prediction, and transfer learning show that FC-GSSL consistently improves performance and generalization.