Revisiting Catastrophic Forgetting in Continual Knowledge Graph Embedding

arXiv cs.LG / 4/22/2026

📰 NewsModels & Research

Key Points

  • The paper studies continual knowledge graph embedding (CKGE), focusing on why models can suffer “catastrophic forgetting” as knowledge graphs evolve over time.
  • It argues that existing CKGE approaches and evaluations are incomplete because they largely ignore a separate failure mode called “entity interference,” where embeddings of newly added entities disrupt previously learned ones.
  • The authors propose a corrected CKGE evaluation protocol that explicitly accounts for entity interference, aiming to make catastrophic forgetting assessments more reliable.
  • Experiments across multiple benchmarks show that ignoring entity interference can overestimate CKGE performance by as much as 25%, especially when the number of entities grows significantly.
  • The work further dissects which CKGE methods and knowledge graph embedding (KGE) models are affected by different forgetting sources and introduces a CKGE-specific catastrophic forgetting metric.

Abstract

Knowledge Graph Embeddings (KGEs) support a wide range of downstream tasks over Knowledge Graphs (KGs). In practice, KGs evolve as new entities and facts are added, motivating Continual Knowledge Graph Embedding (CKGE) methods that update embeddings over time. Current CKGE approaches address catastrophic forgetting (i.e., the performance degradation on previously learned tasks) primarily by limiting changes to existing embeddings. However, we show that this view is incomplete. When new entities are introduced, their embeddings can interfere with previously learned ones, causing the model to predict them in place of previously correct answers. This phenomenon, which we call entity interference, has been largely overlooked and is not accounted for in current CKGE evaluation protocols. As a result, the assessment of catastrophic forgetting becomes misleading, and CKGE methods performance is systematically overestimated. To address this issue, we introduce a corrected CKGE evaluation protocol that accounts for entity interference. Through experiments on multiple benchmarks, we show that ignoring this effect can lead to performance overestimation of up to 25%, particularly in scenarios with significant entity growth. We further analyze how different CKGE methods and KGE models are affected by the different sources of forgetting, and introduce a catastrophic forgetting metric tailored to CKGE.