LLM StructCore: Schema-Guided Reasoning Condensation and Deterministic Compilation
arXiv cs.CL / 4/23/2026
📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research
Key Points
- The paper introduces LLM StructCore, a contract-driven two-stage system for filling Dyspnea clinical case report forms with a strict 134-item output schema.
- Instead of predicting all fields at once, Stage 1 generates a stable Schema-Guided Reasoning (SGR)-style JSON summary containing exactly nine domain keys.
- Stage 2 is a deterministic, 0-LLM “compiler” that parses Stage 1 output, canonicalizes item names, normalizes to the official controlled vocabulary, gates evidence to reduce false positives, and expands predictions to the full 134-item format.
- Experiments for CL4Health 2026 show strong results (dev80 macro-F1 up to 0.6543 EN and 0.6905 IT, with an English Codabench hidden score of 0.63) and demonstrate language-agnostic performance between English and Italian.
- The approach is motivated by the extreme sparsity of known fields and scoring penalties for both empty values and unsupported predictions, emphasizing precision through schema constraints and deterministic post-processing.
Related Articles

Black Hat USA
AI Business

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Trajectory Forecasts in Unknown Environments Conditioned on Grid-Based Plans
Dev.to

Elevating Austria: Google invests in its first data center in the Alps.
Google Blog

10 AI Tools Every Developer Should Try in 2026
Dev.to