TriEx: A Game-based Tri-View Framework for Explaining Internal Reasoning in Multi-Agent LLMs
arXiv cs.CL / 4/23/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- TriEx is a tri-view explainability framework for multi-agent LLM systems in interactive, partially observable environments.
- It produces structured, evidence-anchored artifacts from three perspectives: the agent’s self-reasoning tied to actions, a time-evolving model of opponents’ beliefs, and third-person oracle audits grounded in environment reference signals.
- By converting explanations from free-form narratives into checkable objects, TriEx enables comparisons of explanation faithfulness across time and viewpoints.
- Experiments on imperfect-information strategic games show TriEx can analyze belief dynamics and evaluator reliability, exposing systematic gaps between what agents claim, what they believe, and what they actually do.
- The paper argues that explainability is interaction-dependent and supports multi-view, evidence-grounded evaluation for LLM agents.
Related Articles

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Trajectory Forecasts in Unknown Environments Conditioned on Grid-Based Plans
Dev.to

Why use an AI gateway at all?
Dev.to

OpenAI Just Named It Workspace Agents. We Open-Sourced Our Lark Version Six Months Ago
Dev.to

GPT Image 2 Subject-Lock Editing: A Practical Guide to input_fidelity
Dev.to