CogRAG+: Cognitive-Level Guided Diagnosis and Remediation of Memory and Reasoning Deficiencies in Professional Exam QA

arXiv cs.CL / 4/30/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper introduces CogRAG+, a training-free framework designed to separate and better align retrieval-augmented generation with human cognitive hierarchies for professional exam-style QA.
It proposes “Reinforced Retrieval,” a judge-driven dual-path approach (fact-centric and option-centric) to improve retrieval quality and prevent cascading failures from missing foundational knowledge.
It also introduces “cognition-stratified Constrained Reasoning,” replacing unconstrained chain-of-thought generation with structured templates to reduce logical inconsistency and redundant generation.
Experiments on Qwen3-8B and Llama3.1-8B show consistent gains over general-purpose models and standard RAG on the Registered Dietitian qualification exam, reaching 85.8% accuracy for Qwen3-8B and 60.3% for Llama3.1-8B in single-question mode.
The method further lowers the unanswered rate from 7.6% to 1.4%, suggesting improved reliability on specialized professional tasks.

Abstract

Professional domain knowledge underpins human civilization, serving as both the basis for industry entry and the core of complex decision-making and problem-solving. However, existing large language models often suffer from opaque inference processes in which retrieval and reasoning are tightly entangled, causing knowledge gaps and reasoning inconsistencies in professional tasks. To address this, we propose CogRAG+, a training-free framework that decouples and aligns the retrieval-augmented generation pipeline with human cognitive hierarchies. First, we introduce Reinforced Retrieval, a judge-driven dual-path strategy with fact-centric and option-centric paths that strengthens retrieval and mitigates cascading failures caused by missing foundational knowledge. We then develop cognition-stratified Constrained Reasoning, which replaces unconstrained chain-of-thought generation with structured templates to reduce logical inconsistency and generative redundancy. Experiments on two representative models, Qwen3-8B and Llama3.1-8B, show that CogRAG+ consistently outperforms general-purpose models and standard RAG methods on the Registered Dietitian qualification exam. In single-question mode, it raises overall accuracy to 85.8\% for Qwen3-8B and 60.3\% for Llama3.1-8B, with clear gains over vanilla baselines. Constrained Reasoning also reduces the unanswered rate from 7.6\% to 1.4\%. CogRAG+ offers a robust, model-agnostic path toward training-free expert-level performance in specialized domains.