Learning-to-Explain through 20Q Gaming: An Explainable Recommender for Cybersecurity Education

arXiv cs.LG / 5/1/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces an explainable AI–based educational framework called the Q20 cybersecurity game to improve the interactivity and adaptability of cybersecurity training.
  • It proposes the Explainable Q20 Cybersecurity Recommender (EQ-20CR), a policy-based reinforcement-learning agent that asks a targeted sequence of questions and uses the answers to justify defensive actions.
  • The system frames “Why should I execute this mitigation?” as a 20 questions (Q20) dialogue, producing both an optimal recommendation and a concise explanation trace.
  • The framework is designed to adapt question informativeness and difficulty over time, gradually helping learners recognize and articulate cybersecurity concepts, attack vectors, or defense strategies.
  • The authors present the architecture and demonstrate its potential via case studies across multiple cybersecurity concepts, aiming to enhance training effectiveness and awareness.

Abstract

The growing sophistication of contemporary cyber threats necessitates a more effective and adaptive approach to cybersecurity training. Intuitive and adaptive approaches to learning, which are often required, are not provided in traditional learning methods. In this article, we present a new educational framework, "Learning to Explain Cybersecurity with Q20 Game", based on explainable AI (XAI), an educational game to enhance interactivity in learning. We propose a novel, game-inspired framework - the Explainable Q20 Cybersecurity Recommender (EQ-20CR), that learns to elicit the minimal set of evidential facts needed to justify cybersecurity defensive action. By casting "Why should I execute this mitigation?" as a 20 questions (Q20) game, a policy-based reinforcement-learning (RL) agent actively queries an environment until it can both (i) recommend the optimal security education and (ii) explain that decision with a concise dialogue trace. The article draws from "Playing 20 Question Game with Policy-Based Reinforcement Learning" [1] and "Learning-to-Explain: Recommendation Reason Determination through Q20 Gaming" [2]. The framework uses a policy-based reinforcement learning (RL) agent that leads the user through a sequence of questions to recognize and articulate a targeted cybersecurity concept, attack vector, or defense strategy. Furthermore, users are gradually exposed to informative questions by the system, revealing complicated, structured way at an adaptive difficulty level. In this paper, we design the architecture, its application to various concepts of cybersecurity through illustrative case studies, and its transformative potential on the training and awareness of cybersecurity recommendations.