AI Navigate

強化学習とLLMベースの品質保証による解釈可能なマルコフモデルを用いた時空間リスクサーフェスによる行方不明児捜索計画

arXiv cs.AI / 2026/3/11

Ideas & Deep AnalysisTools & Practical UsageModels & Research

要点

  • Guardianシステムは、行方不明児捜査を支援するために設計されており、非構造化された事例データを構造化された時空間リスクサーフェスに変換し、重要な最初の72時間内の捜索計画に活用します。
  • このシステムの3層からなる予測モデルには、道路のアクセス可能性や時間帯の変動などを考慮した解釈可能なマルコフ連鎖が含まれ、確率的な位置予測を生成します。
  • 2層目では強化学習を用いてマルコフ連鎖の出力を実用的な捜索計画に変換し、その後、大規模言語モデル(LLM)による品質保証を経て展開されます。
  • 合成的かつ現実的な事例研究により、システムが解釈可能で実行可能な捜索計画を提供できること、また感度や潜在的な失敗モードを明らかにしています。
  • このアプローチは、データ処理、確率モデリング、AI支援の検証を統合した包括的な意思決定支援ツールを法執行機関に提供し、より効果的な行方不明児捜索活動を可能にします。

Computer Science > Artificial Intelligence

arXiv:2603.08933 (cs)
[Submitted on 9 Mar 2026]

Title:Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance

View a PDF of the paper titled Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance, by Joshua Castillo and Ravi Mukkamala
View PDF HTML (experimental)
Abstract:The first 72 hours of a missing-child investigation are critical for successful recovery. However, law enforcement agencies often face fragmented, unstructured data and a lack of dynamic, geospatial predictive tools. Our system, Guardian, provides an end-to-end decision-support system for missing-child investigation and early search planning. It converts heterogeneous, unstructured case documents into a schema-aligned spatiotemporal representation, enriches cases with geocoding and transportation context, and provides probabilistic search products spanning 0-72 hours. In this paper, we present an overview of Guardian as well as a detailed description of a three-layer predictive component of the system. The first layer is a Markov chain, a sparse, interpretable model with transitions incorporating road accessibility costs, seclusion preferences, and corridor bias with separate day/night parameterizations. The Markov chain's output prediction distributions are then transformed into operationally useful search plans by the second layer's reinforcement learning. Finally, the third layer's LLM performs post hoc validation of layer 2 search plans prior to their release. Using a synthetic but realistic case study, we report quantitative outputs across 24/48/72-hour horizons and analyze sensitivity, failure modes, and tradeoffs. Results show that the proposed predictive system with the three-layer architecture produces interpretable priors for zone optimization and human review.
Comments:
Subjects: Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as: arXiv:2603.08933 [cs.AI]
  (or arXiv:2603.08933v1 [cs.AI] for this version)
  https://doi.org/10.48550/arXiv.2603.08933
Focus to learn more
arXiv-issued DOI via DataCite

Submission history

From: Joshua Castillo [view email]
[v1] Mon, 9 Mar 2026 21:08:29 UTC (14,788 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance, by Joshua Castillo and Ravi Mukkamala
  • View PDF
  • HTML (experimental)
  • TeX Source
Current browse context:
cs.AI
< prev   |   next >
Change to browse by:

References & Citations

export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo
Bibliographic Tools

Bibliographic and Citation Tools

Bibliographic Explorer Toggle
Bibliographic Explorer (What is the Explorer?)
Connected Papers Toggle
Connected Papers (What is Connected Papers?)
Litmaps Toggle
Litmaps (What is Litmaps?)
scite.ai Toggle
scite Smart Citations (What are Smart Citations?)
Code, Data, Media

Code, Data and Media Associated with this Article

alphaXiv Toggle
alphaXiv (What is alphaXiv?)
Links to Code Toggle
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub Toggle
DagsHub (What is DagsHub?)
GotitPub Toggle
Gotit.pub (What is GotitPub?)
Huggingface Toggle
Hugging Face (What is Huggingface?)
Links to Code Toggle
Papers with Code (What is Papers with Code?)
ScienceCast Toggle
ScienceCast (What is ScienceCast?)
Demos

Demos

Replicate Toggle
Replicate (What is Replicate?)
Spaces Toggle
Hugging Face Spaces (What is Spaces?)
Spaces Toggle
TXYZ.AI (What is TXYZ.AI?)
Related Papers

Recommenders and Search Tools

Link to Influence Flower
Influence Flower (What are Influence Flowers?)
Core recommender toggle
CORE Recommender (What is CORE?)
About arXivLabs

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.