SC-Taxo: Hierarchical Taxonomy Generation under Semantic Consistency Constraints using Large Language Models

arXiv cs.CL / 5/4/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that automatic scientific taxonomy generation is hindered by structural inconsistencies and semantic misalignment between different hierarchy levels.
  • It identifies the root cause as insufficient modeling of hierarchical semantic consistency and proposes SC-Taxo to address this gap.
  • SC-Taxo uses large language models with hierarchy-aware refinement stages, including bidirectional heading generation that combines bottom-up abstraction with top-down semantic constraints.
  • It also models peer-level (horizontal) semantic dependencies to improve consistency across sections at the same hierarchy depth.
  • Experiments on multiple benchmarks show improved hierarchy alignment and heading quality, and additional testing on Chinese literature confirms robust cross-lingual generalization.

Abstract

Scientific literature is expanding at an unprecedented pace, making it increasingly challenging to efficiently organize and access domain knowledge. A high-quality scientific taxonomy offers a structured and hierarchical representation of a research field, facilitating literature exploration and topic navigation, as well as enabling downstream applications such as trend analysis, idea generation, and information retrieval. However, existing taxonomy generation approaches often suffer from structural inconsistencies and semantic misalignment across hierarchical levels. Through empirical analysis, we find that these issues largely stem from inadequate modeling of hierarchical semantic consistency. To address this limitation, we propose a semantic-consistent taxonomy generation (SC-Taxo) framework that leverages large language models (LLMs) with hierarchy-aware refinement stages to ensure semantic consistency. Specifically, SC-Taxo introduces a bidirectional heading generation mechanism that jointly performs bottom-up abstraction and top-down semantic constraint, while further capturing peer-level semantic dependencies to enhance horizontal consistency. Experiments on multiple benchmark datasets demonstrate consistent improvements in hierarchy alignment and heading quality, and additional evaluation on Chinese scientific literature validates its robust cross-lingual generalization.