From Fuzzy to Formal: Scaling Hospital Quality Improvement with AI

arXiv cs.AI / 4/23/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes using AI to accelerate and improve hospital Quality Improvement (QI) factor discovery, an expert-driven process that is often time-consuming, hard to reproduce, and limited in auditability.
  • It argues that QI factor discovery is inherently exploratory and “fuzzy,” so the approach must preserve iterative sense-making rather than rely on assumptions that the task is fully well-defined.
  • The authors introduce a “Human-AI Spec-Solution Co-optimization” framework that treats natural-language QI specifications as tunable components alongside prompt learning, mapping the workflow to problem formalization, model learning, and validation.
  • In a real-world deployment at an urban safety-net hospital, the resulting AI-for-QI pipelines achieved at least 70% concordance with expert annotations and outperformed prior manual Lean analyses by being more efficient, recovering known drivers, and surfacing new modifiable factors.
  • The framework also claims to produce auditable reasoning traces, addressing governance needs by making AI outputs more transparent and aligned with clinical objectives.

Abstract

Hospital Quality Improvement (QI) plays a critical role in optimizing healthcare delivery by translating high-level hospital goals into actionable solutions. A critical step of QI is to identify the key modifiable contributing factors, a process we call QI factor discovery, typically through expert-driven semi-structured qualitative tools like fishbone diagrams, chart reviews, and Lean Healthcare methods. AI has the potential to transform and accelerate QI factor discovery, which is traditionally time- and resource-intensive and limited in reproducibility and auditability. Nevertheless, current AI alignment methods assume the task is well-defined, whereas QI factor discovery is an exploratory, fuzzy, and iterative sense-making process that relies on complex implicit expert judgments. To design an AI pipeline that formalizes the QI process while preserving its exploratory components, we propose viewing the task as learning not only LLM prompts but also the overarching natural-language specifications. In particular, we map QI factor discovery to steps of the classical AI/ML development process (problem formalization, model learning, and model validation) where the specifications are tunable hyperparameters. Domain experts and AI agents iteratively refine both the overarching specifications and AI pipeline until AI extractions are concordant with expert annotations and aligned with clinical objectives. We applied this "Human-AI Spec-Solution Co-optimization" framework at an urban safety-net hospital to identify factors driving prolonged length of stay and unplanned 30-day readmissions. The resulting AI-for-QI pipelines achieved \ge 70\% concordance with expert annotations. Compared to prior manual Lean analyses, the AI pipeline was substantially more efficient, recovered previous findings, surfaced new modifiable factors, and produced auditable reasoning traces.