AutoSAM: an Agentic Framework for Automating Input File Generation for the SAM Code with Multi-Modal Retrieval-Augmented Generation

arXiv cs.AI / 3/27/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisTools & Practical UsageModels & Research

共有:

Key Points

AutoSAMは、System Analysis Module（SAM）などの熱水力コード向けの入力ファイル作成を自動化するエージェント型フレームワークであり、設計情報をソルバー構文へ変換する作業負荷を減らすことを目的としています。
LLMエージェントに加えて、SAMのユーザーガイド／理論マニュアルを対象にしたRAGと、PDF・画像・スプレッドシート・テキスト等を解析する専用ツールを組み合わせ、未整理の工学ドキュメントからシミュレーション関連パラメータを抽出します。
抽出したパラメータは人が監査できる中間表現（intermediate representation）に整理され、その後、検証された形で実行可能なSAM入力デッキとして生成されます。
マルチモーダル検索では、科学テキスト抽出、図の視覚的解釈、意味埋め込み、質問応答を統合し、ケーススタディ4件（単純系から高度炉心・実験ループまで）で整合性のあるモデル生成を示しています。
評価では、構造化入力の100%活用やPDFテキスト抽出の約88%、図形幾何抽出の100%完全性に加え、不足データの明示と仮定値のラベリングも行える点が示されています。

Abstract

In the design and safety analysis of advanced reactor systems, constructing input files for system-level thermal-hydraulics codes such as the System Analysis Module (SAM) remains a labor-intensive task. Analysts must extract and reconcile design data from heterogeneous engineering documents and manually translate it into solver-specific syntax. In this paper, we present AutoSAM, an agentic framework that automates SAM input file generation. The framework combines a large language model agent with retrieval-augmented generation over the solver's user guide and theory manual, together with specialized tools for analyzing PDFs, images, spreadsheets, and text files. AutoSAM ingests unstructured engineering documents, including system diagrams, design reports, and data tables, extracts simulation-relevant parameters into a human-auditable intermediate representation, and synthesizes validated, solver-compatible input decks. Its multimodal retrieval pipeline integrates scientific text extraction, vision-based figure interpretation, semantic embedding, and query answering. We evaluate AutoSAM on four case studies of increasing complexity: a single-pipe steady-state model, a solid-fuel channel with temperature reactivity feedback, the Advanced Burner Test Reactor core, and the Molten Salt Reactor Experiment primary loop. Across all cases, the agent produces runnable SAM models consistent with expected thermal-hydraulic behavior while explicitly identifying missing data and labeling assumed values. The framework achieves 100% utilization of structured inputs, about 88% extraction from PDF text, and 100% completeness in vision-based geometric extraction. These results demonstrate a practical path toward prompt-driven reactor modeling, in which analysts provide system descriptions and supporting documentation while the agent translates them into transparent, and executable, SAM simulations.