From Business Events to Auditable Decisions: Ontology-Governed Graph Simulation for Enterprise AI

arXiv cs.AI / 2026/4/13

📰 ニュースIdeas & Deep AnalysisModels & Research

要点

  • The paper argues that current LLM agent systems can produce fluent yet ungrounded decisions because they operate over an unrestricted knowledge space without simulating how business events alter the relevant scenario context.
  • It introduces LOM-action, an event-driven ontology simulation approach where enterprise ontology–encoded scenario conditions deterministically mutate an isolated graph sandbox to form a scenario-valid simulation graph from which decisions are made.
  • The system uses a dual-mode pipeline (“skill mode” and “reasoning mode”) implementing event → simulation → decision, and it generates fully traceable audit logs for every decision.
  • Evaluation reports 93.82% accuracy and 98.74% tool-chain F1, far outperforming baselines (Doubao-1.8 and DeepSeek-V3.2) that show only 24–36% F1, which the authors interpret as evidence against “illusory accuracy.”
  • The authors conclude that trustworthy enterprise decision intelligence depends more on ontology-governed, event-driven simulation architecture than on simply scaling the underlying model.

Abstract

Existing LLM-based agent systems share a common architectural failure: they answer from the unrestricted knowledge space without first simulating how active business scenarios reshape that space for the event at hand -- producing decisions that are fluent but ungrounded and carrying no audit trail. We present LOM-action, which equips enterprise AI with \emph{event-driven ontology simulation}: business events trigger scenario conditions encoded in the enterprise ontology~(EO), which drive deterministic graph mutations in an isolated sandbox, evolving a working copy of the subgraph into the scenario-valid simulation graph G_{\text{sim}}; all decisions are derived exclusively from this evolved graph. The core pipeline is \emph{event \to simulation \to decision}, realized through a dual-mode architecture -- \emph{skill mode} and \emph{reasoning mode}. Every decision produces a fully traceable audit log. LOM-action achieves 93.82% accuracy and 98.74% tool-chain F1 against frontier baselines Doubao-1.8 and DeepSeek-V3.2, which reach only 24--36% F1 despite 80% accuracy -- exposing the \emph{illusive accuracy} phenomenon. The four-fold F1 advantage confirms that ontology-governed, event-driven simulation, not model scale, is the architectural prerequisite for trustworthy enterprise decision intelligence.