Agentic AI for Substance Use Education: Integrating Regulatory and Scientific Knowledge Sources

arXiv cs.CL / 5/4/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper addresses limitations of traditional substance-use education—especially scalability, personalization, and keeping information up to date—and proposes real-time AI support as an underexplored solution.
  • Researchers built an agentic AI web application that integrates DEA records with peer-reviewed scientific literature using retrieval-augmented generation and dynamically issued PubMed queries.
  • The system uses a semantically chunked and vector-indexed document corpus to retrieve relevant regulatory and scientific context, enabling transparent, context-sensitive educational responses.
  • An expert evaluation with five domain experts and independent raters found high-quality outputs, with mean ratings between 4.18 and 4.35 across factual accuracy, citation quality, contextual coherence, and regulatory appropriateness, plus strong inter-rater agreement (Cohen’s kappa = 0.78).
  • The results suggest that agentic AI architectures combining authoritative regulatory sources with up-to-date research can deliver scalable, accurate, and verifiable substance-use education, though further longitudinal studies are needed.

Abstract

The delivery of traditional substance education has remained problematic due to challenges in scalability, personalization, and the currency of information in a rapidly evolving substance use landscape. While artificial intelligence (AI) offers a promising frontier for enhancing educational delivery, its application in providing real-time, authoritative substance use education remains largely underexplored. We built an agentic-based AI web application that combined Drug Enforcement Administration records with peer-reviewed literature in real-time to provide transparent context-sensitive substance use education. The system uses retrieval-augmented generation with a carefully filtered corpus of 102 documents and dynamic PubMed queries. Document storage was semantically chunked and placed in a vector representation in order to be easily retrieved. We conducted an expert evaluation study in which a panel of five subject matter experts generated 30 domain-specific questions, and two independent raters assessed 90 system interactions (30 primary questions plus two contextual follow-ups each) using a five-point Likert scale across four criteria: factual accuracy, citation quality, contextual coherence, and regulatory appropriateness. Mean ratings ranged from 4.18 to 4.35 across the four criteria (overall category range: 4.05-4.52), with substantial inter-rater agreement (Cohen's kappa = 0.78). These findings suggest that agentic AI architectures integrating authoritative regulatory sources with real-time scientific literature represent a promising direction for scalable, accurate, and verifiable health education delivery, warranting further evaluation through longitudinal user studies.