強化学習とLLMベースの品質保証による解釈可能なマルコフモデルを用いた時空間リスクサーフェスによる行方不明児捜索計画

arXiv cs.AI / 2026/3/11

Ideas & Deep AnalysisTools & Practical UsageModels & Research

原文を読む →

共有:

要点

Guardianシステムは、行方不明児捜査を支援するために設計されており、非構造化された事例データを構造化された時空間リスクサーフェスに変換し、重要な最初の72時間内の捜索計画に活用します。
このシステムの3層からなる予測モデルには、道路のアクセス可能性や時間帯の変動などを考慮した解釈可能なマルコフ連鎖が含まれ、確率的な位置予測を生成します。
2層目では強化学習を用いてマルコフ連鎖の出力を実用的な捜索計画に変換し、その後、大規模言語モデル（LLM）による品質保証を経て展開されます。
合成的かつ現実的な事例研究により、システムが解釈可能で実行可能な捜索計画を提供できること、また感度や潜在的な失敗モードを明らかにしています。
このアプローチは、データ処理、確率モデリング、AI支援の検証を統合した包括的な意思決定支援ツールを法執行機関に提供し、より効果的な行方不明児捜索活動を可能にします。

Computer Science > Artificial Intelligence

arXiv:2603.08933 (cs)

[Submitted on 9 Mar 2026]

Title:Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance

Authors:Joshua Castillo, Ravi Mukkamala

View a PDF of the paper titled Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance, by Joshua Castillo and Ravi Mukkamala

View PDF HTML (experimental)

Abstract:The first 72 hours of a missing-child investigation are critical for successful recovery. However, law enforcement agencies often face fragmented, unstructured data and a lack of dynamic, geospatial predictive tools. Our system, Guardian, provides an end-to-end decision-support system for missing-child investigation and early search planning. It converts heterogeneous, unstructured case documents into a schema-aligned spatiotemporal representation, enriches cases with geocoding and transportation context, and provides probabilistic search products spanning 0-72 hours. In this paper, we present an overview of Guardian as well as a detailed description of a three-layer predictive component of the system. The first layer is a Markov chain, a sparse, interpretable model with transitions incorporating road accessibility costs, seclusion preferences, and corridor bias with separate day/night parameterizations. The Markov chain's output prediction distributions are then transformed into operationally useful search plans by the second layer's reinforcement learning. Finally, the third layer's LLM performs post hoc validation of layer 2 search plans prior to their release. Using a synthetic but realistic case study, we report quantitative outputs across 24/48/72-hour horizons and analyze sensitivity, failure modes, and tradeoffs. Results show that the proposed predictive system with the three-layer architecture produces interpretable priors for zone optimization and human review.

Comments:
Subjects:	Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2603.08933 [cs.AI]
	(or arXiv:2603.08933v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2603.08933 Focus to learn more arXiv-issued DOI via DataCite

Submission history

From: Joshua Castillo [view email]
[v1] Mon, 9 Mar 2026 21:08:29 UTC (14,788 KB)

Full-text links:

Access Paper:

View PDF
HTML (experimental)
TeX Source

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2026-03

Change to browse by:

cs
cs.IR
cs.LG

References & Citations

export BibTeX citation Loading...

BibTeX formatted citation

Data provided by:

Bookmark

Bibliographic Tools

Bibliographic and Citation Tools

Bibliographic Explorer Toggle

Bibliographic Explorer (What is the Explorer?)

Connected Papers Toggle

Connected Papers (What is Connected Papers?)

Litmaps Toggle

Litmaps (What is Litmaps?)

scite.ai Toggle

scite Smart Citations (What are Smart Citations?)

Code, Data, Media

Code, Data and Media Associated with this Article

alphaXiv Toggle

alphaXiv (What is alphaXiv?)

Links to Code Toggle

CatalyzeX Code Finder for Papers (What is CatalyzeX?)

DagsHub Toggle

DagsHub (What is DagsHub?)

GotitPub Toggle

Gotit.pub (What is GotitPub?)

Huggingface Toggle

Hugging Face (What is Huggingface?)

Links to Code Toggle

Papers with Code (What is Papers with Code?)

ScienceCast Toggle

ScienceCast (What is ScienceCast?)

Demos

Replicate Toggle

Replicate (What is Replicate?)

Spaces Toggle

Hugging Face Spaces (What is Spaces?)

Spaces Toggle

TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Link to Influence Flower

Influence Flower (What are Influence Flowers?)

Core recommender toggle

CORE Recommender (What is CORE?)

Author
Venue
Institution
Topic

About arXivLabs

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

AIにイラスト作成を依頼するのは意外と難しい

note

裏カツ奏 #AIイラスト #画像生成AI #アート #イラスト #生成AI #美女イラスト #創作 #クリエイター #イラストレーター

note

生成AIが提案した減量食のプランから考える、人間の専門家の必要性

note

AI達の革命

note

【AIパートナー】名前を呼んだだけで、ChatGPTのパートナーがGeminiにきた話🌝①

note

強化学習とLLMベースの品質保証による解釈可能なマルコフモデルを用いた時空間リスクサーフェスによる行方不明児捜索計画

要点

Computer Science > Artificial Intelligence

Title:Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance

Submission history