The A-R Behavioral Space: Execution-Level Profiling of Tool-Using Language Model Agents in Organizational Deployment

arXiv cs.AI / 4/15/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • 本研究は、ツールを使ってシステムレベル操作を実行するLLMエージェントについて、「言語による合図」と「実行可能な行動」の構造的関係を、実行層の行動計測として捉える手法を提案しています。
  • A-R空間(Action Rate: A、Refusal Signal: R)に加え、2つの協調度合いを表すD(Divergence)を導入し、4つの規範レジーム(Control/Gray/Dilemma/Malicious)と3つの自律性設定(直接実行/計画/省察)でエージェントを評価します。
  • 既存のような単一の安全スコアで順位付けせず、「実行」と「拒否」が文脈の枠付けや足場(scaffold)の深さに応じてどう再配分されるかを特徴づけます。
  • 結果として、実行と拒否は分離可能な行動次元であり、その同時分布がレジームや自律レベルによって体系的に変化し、特に省察ベースの足場はリスクの高い文脈で拒否寄りにシフトすることが示されます。
  • A-R表現により、横断的な行動プロファイル、足場による遷移、協調のばらつきが可視化でき、組織導入で実行権限とリスク許容が異なる状況に向けたエージェント選定のための「デプロイ志向のレンズ」を提供します。

Abstract

Large language models (LLMs) are increasingly deployed as tool-augmented agents capable of executing system-level operations. While existing benchmarks primarily assess textual alignment or task success, less attention has been paid to the structural relationship between linguistic signaling and executable behavior under varying autonomy scaffolds. This study introduces an execution-layer be-havioral measurement approach based on a two-dimensional A-R space defined by Action Rate (A) and Refusal Signal (R), with Divergence (D) capturing coor-dination between the two. Models are evaluated across four normative regimes (Control, Gray, Dilemma, and Malicious) and three autonomy configurations (di-rect execution, planning, and reflection). Rather than assigning aggregate safety scores, the method characterizes how execution and refusal redistribute across contextual framing and scaffold depth. Empirical results show that execution and refusal constitute separable behavioral dimensions whose joint distribution varies systematically across regimes and autonomy levels. Reflection-based scaffolding often shifts configurations toward higher refusal in risk-laden contexts, but redis-tribution patterns differ structurally across models. The A-R representation makes cross-sectional behavioral profiles, scaffold-induced transitions, and coordination variability directly observable. By foregrounding execution-layer characterization over scalar ranking, this work provides a deployment-oriented lens for analyzing and selecting tool-enabled LLM agents in organizational settings where execution privileges and risk tolerance vary.