Knowledge Boundary Discovery for Large Language Models
arXiv cs.AI / 2026/3/24
💬 オピニオンIdeas & Deep AnalysisModels & Research
要点
- The paper introduces Knowledge Boundary Discovery (KBD), a reinforcement learning framework that maps where an LLM can and cannot answer questions with confidence.
- KBD distinguishes between an “within-knowledge boundary” set of answerable questions and a “beyond-knowledge boundary” set of unanswerable ones by iteratively probing the model.
- To address hallucinations, it treats questioning as an agent interacting with a partially observable environment, using entropy reduction as the reward signal.
- The method incrementally builds belief states from the LLM’s responses and generates a set of non-trivial answerable/unanswerable questions.
- Validation against manually crafted benchmark datasets finds the automatically generated question sets are comparable to human-created evaluations, suggesting KBD as a new LLM evaluation direction.

