DQA: Diagnostic Question Answering for IT Support

arXiv cs.CL / 4/8/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces DQA (Diagnostic Question Answering), a framework designed for enterprise IT support dialogues where resolving issues requires iterative evidence gathering to pinpoint root causes.
  • Unlike standard multi-turn RAG, DQA maintains a persistent diagnostic state and aggregates retrieved cases by root cause to better accumulate evidence across turns and manage competing hypotheses.
  • DQA uses conversational query rewriting, retrieval aggregation, and state-conditioned response generation to produce systematic troubleshooting responses within enterprise latency and context constraints.
  • In evaluations on 150 anonymized enterprise IT support scenarios using a replay-based protocol, DQA achieves a 78.7% success rate versus 41.3% for a multi-turn RAG baseline, while cutting average turns from 8.4 to 3.9.

Abstract

Enterprise IT support interactions are fundamentally diagnostic: effective resolution requires iterative evidence gathering from ambiguous user reports to identify an underlying root cause. While retrieval-augmented generation (RAG) provides grounding through historical cases, standard multi-turn RAG systems lack explicit diagnostic state and therefore struggle to accumulate evidence and resolve competing hypotheses across turns. We introduce DQA, a diagnostic question-answering framework that maintains persistent diagnostic state and aggregates retrieved cases at the level of root causes rather than individual documents. DQA combines conversational query rewriting, retrieval aggregation, and state-conditioned response generation to support systematic troubleshooting under enterprise latency and context constraints. We evaluate DQA on 150 anonymized enterprise IT support scenarios using a replay-based protocol. Averaged over three independent runs, DQA achieves a 78.7% success rate under a trajectory-level success criterion, compared to 41.3% for a multi-turn RAG baseline, while reducing average turns from 8.4 to 3.9.