SymptomWise: A Deterministic Reasoning Layer for Reliable and Efficient AI Systems

arXiv cs.AI / 4/10/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces SymptomWise, a framework aimed at improving reliability, interpretability, and reducing hallucinations in AI symptom analysis for safety-critical contexts.
  • SymptomWise separates language understanding from diagnostic reasoning by using curated medical knowledge plus a deterministic, codex-driven inference module over a finite hypothesis space.
  • Large language models are constrained to symptom extraction and optional explanations, while the actual diagnostic inference is performed deterministically to improve traceability.
  • In preliminary testing on 42 challenging pediatric neurology cases, the correct diagnosis appeared in the top five differentials in 88% of cases, with substantial agreement with clinician consensus.
  • The authors argue the approach generalizes beyond medicine as a deterministic structuring/routing layer for foundation models in other abductive reasoning tasks, potentially lowering unnecessary computation.

Abstract

AI-driven symptom analysis systems face persistent challenges in reliability, interpretability, and hallucination. End-to-end generative approaches often lack traceability and may produce unsupported or inconsistent diagnostic outputs in safety-critical settings. We present SymptomWise, a framework that separates language understanding from diagnostic reasoning. The system combines expert-curated medical knowledge, deterministic codex-driven inference, and constrained use of large language models. Free-text input is mapped to validated symptom representations, then evaluated by a deterministic reasoning module operating over a finite hypothesis space to produce a ranked differential diagnosis. Language models are used only for symptom extraction and optional explanation, not for diagnostic inference. This architecture improves traceability, reduces unsupported conclusions, and enables modular evaluation of system components. Preliminary evaluation on 42 expert-authored challenging pediatric neurology cases shows meaningful overlap with clinician consensus, with the correct diagnosis appearing in the top five differentials in 88% of cases. Beyond medicine, the framework generalizes to other abductive reasoning domains and may serve as a deterministic structuring and routing layer for foundation models, improving precision and potentially reducing unnecessary computational overhead in bounded tasks.