Beyond Predefined Schemas: TRACE-KG for Context-Enriched Knowledge Graphs from Complex Documents

arXiv cs.AI / 4/7/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper highlights a key limitation of existing knowledge graph (KG) construction approaches: ontology-driven pipelines require expensive schema maintenance, while schema-free extraction can yield fragmented graphs with weak global organization in long, context-heavy documents.
  • It proposes TRACE-KG, a multimodal framework that jointly induces a context-enriched KG and an induced schema without relying on any predefined ontology.
  • TRACE-KG aims to model conditional/context-dependent information using structured qualifiers and to organize entities and relations via a data-driven, reusable semantic scaffold.
  • The approach emphasizes end-to-end traceability, preserving links from KG elements back to the original source evidence.
  • Experimental results (as reported in the abstract) suggest TRACE-KG produces more structurally coherent, traceable KGs and provides a practical alternative to both ontology-driven and purely schema-free pipelines.

Abstract

Knowledge graph construction typically relies either on predefined ontologies or on schema-free extraction. Ontology-driven pipelines enforce consistent typing but require costly schema design and maintenance, whereas schema-free methods often produce fragmented graphs with weak global organization, especially in long technical documents with dense, context-dependent information. We propose TRACE-KG (Text-dRiven schemA for Context-Enriched Knowledge Graphs), a multimodal framework that jointly constructs a context-enriched knowledge graph and an induced schema without assuming a predefined ontology. TRACE-KG captures conditional relations through structured qualifiers and organizes entities and relations using a data-driven schema that serves as a reusable semantic scaffold while preserving full traceability to the source evidence. Experiments show that TRACE-KG produces structurally coherent, traceable knowledge graphs and offers a practical alternative to both ontology-driven and schema-free construction pipelines.