LLM agents can trigger real actions now. But what actually stops them from executing?

Reddit r/artificial / 4/1/2026

💬 OpinionDeveloper Stack & InfrastructureIdeas & Deep AnalysisTools & Practical Usage

Read original →

共有:

Key Points

The article argues that LLM agents with tool calling can “propose” actions without a strict mechanism to enforce whether those actions should actually execute, creating risk when tools have real side effects like APIs, infrastructure, or payments.
It illustrates the failure mode where the same model and tool can yield different outcomes (allow vs deny) only because a policy blocks a later call before execution, emphasizing that enforcement must occur pre-tool.
It explains that many agent setups still effectively let the model indirectly control execution in a pipeline shaped like model → tool → execution, even if validation, retries, and guardrails exist.
The proposed alternative is to restructure the flow into proposal → (policy + state) → ALLOW/DENY → execution, using authorization gating so that denied actions never reach the tool at all.
A demo repository is linked to show the approach, and the piece ends by prompting readers to share how they gate or monitor agent tool execution today.

We ran into a simple but important issue while building agents with tool calling:

the model can propose actions
but nothing actually enforces whether those actions should execute.

That works fine… until the agent controls real side effects:

APIs
infrastructure
payments
workflows

Example

Same model, same tool, same input:

#1 provision_gpu -> ALLOW #2 provision_gpu -> ALLOW #3 provision_gpu -> DENY

The key detail:

the third call is blocked before execution

No retry
No partial execution
No side effect

The underlying problem

Most setups look like this:

model -> tool -> execution

Even with:

validation
retries
guardrails

…the model still indirectly controls when execution happens.

What changed

We tried a different approach:

proposal -> (policy + state) -> ALLOW / DENY -> execution

Key constraint:

no authorization -> no execution path

So a denied action doesn’t just “fail”, it never reaches the tool at all.

Demo

https://github.com/AngeYobo/oxdeai/tree/main/examples/openai-tools

Why this feels important

Once agents move from “thinking” to “acting”,
the risk is no longer the output, it’s the side effect.

And right now, most systems don’t have a clear boundary there.

Question

How are you handling this?

Do you gate execution before tool calls?
Or rely on retries / monitoring after the fact?

submitted by /u/docybo
[link] [comments]

Black Hat USA

AI Business

Black Hat Asia

AI Business

Show HN: 1-Bit Bonsai, the First Commercially Viable 1-Bit LLMs

Dev.to

5 AI Writing Prompts That Sound Human (Not Like Every Other AI Article)

Dev.to

I Built an AI Agent That Can Write Its Own Tools When It Gets Stuck

Dev.to

LLM agents can trigger real actions now. But what actually stops them from executing?

Key Points

Related Articles

Black Hat USA

Black Hat Asia

Show HN: 1-Bit Bonsai, the First Commercially Viable 1-Bit LLMs

5 AI Writing Prompts That Sound Human (Not Like Every Other AI Article)

I Built an AI Agent That Can Write Its Own Tools When It Gets Stuck

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer