SC-Taxo: Hierarchical Taxonomy Generation under Semantic Consistency Constraints using Large Language Models
arXiv cs.CL / 5/4/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that automatic scientific taxonomy generation is hindered by structural inconsistencies and semantic misalignment between different hierarchy levels.
- It identifies the root cause as insufficient modeling of hierarchical semantic consistency and proposes SC-Taxo to address this gap.
- SC-Taxo uses large language models with hierarchy-aware refinement stages, including bidirectional heading generation that combines bottom-up abstraction with top-down semantic constraints.
- It also models peer-level (horizontal) semantic dependencies to improve consistency across sections at the same hierarchy depth.
- Experiments on multiple benchmarks show improved hierarchy alignment and heading quality, and additional testing on Chinese literature confirms robust cross-lingual generalization.
Related Articles
A very basic litmus test for LLMs "ok give me a python program that reads my c: and put names and folders in a sorted list from biggest to small"
Reddit r/LocalLLaMA

ALM on Power Platform: ADO + GitHub, the best of both worlds
Dev.to

Experiment: Does repeated usage influence ChatGPT 5.4 outputs in a RAG-like setup?
Dev.to

Find 12 high-volume, low-competition GEO content topics Topify.ai should rank on
Dev.to

When a memorized rule fits your bug too well: a meta-trap of agent workflows
Dev.to