AI Agent Privilege Design: Least Privilege, Sandbox, Human Approval

AI Navigate Original / 4/27/2026

💬 OpinionDeveloper Stack & InfrastructureTools & Practical Usage

共有:

Key Points

Agents execute LLM judgment directly; injection/hallucination cause harm
3 pillars: least privilege, sandbox, human approval for irreversible ops
Audit-log all operations 90+ days for post-hoc tracking
A 2026 incident destroyed prod+backups; keep backups outside agent scope

Why Privilege Design Matters

AI agents (Claude Code, Devin, Operator, Replit Agent, etc.) are mechanisms that let external tools execute the LLM's judgment results directly. Convenient, but prompt injection or LLM hallucination can directly cause "DELETE on the production DB" or "email to all customers."

In 2025 MCP (Model Context Protocol) spread and the number of tools agents can handle exploded. That's exactly why the three pillars of least privilege, sandbox, and human approval are essential.

1. Least Privilege

Give each agent only the minimum scope needed for the task.

Read-only keys: SELECT-only for data-aggregation agents. INSERT/UPDATE/DELETE is a separate agent.
Directory restriction: coding agents can write only under a specific repo. Can't read /etc or ~/.ssh.
API scope: for GitHub Apps, repo:read only. OAuth carved out per user.
Expiry: set tokens short-lived (1-24h), rotate periodically.

In June 2026, Cloudflare introduced "Temporary Cloudflare Accounts for Agents" (GIGAZINE): AI agents performing deploys or similar tasks get a disposable account that is not tied to any human Cloudflare account, and that account is destroyed once the task ends. It is the "short-lived token + minimum scope + rotation" pattern this article describes, but provided as a first-class cloud-platform feature — agent credentials are decoupled from human accounts at the account level itself, pushing minimum-privilege from per-project implementations into the platform layer.

2. Sandbox

Sign up to read the full article

Create a free account to access the full content of our original articles.

Nous Research Updates Hermes Agent With a Blank Slate Mode That Pins Toolsets via platform_toolsets.cli and disabled_toolsets

MarkTechPost

Upload your product docs to BizNode's knowledge base. Your Telegram bot instantly answers customer questions from your own data

Dev.to

Your Selfie Was Fine. 3 Hidden Checks Just Failed You Anyway.

Dev.to

On-Device GenAI with Apple Core AI, Securing LLM Agents, & Mobile RPA

Dev.to

I Packaged My AI Productivity System Into a $1 Kit — Here's Everything In It

Dev.to

AI Agent Privilege Design: Least Privilege, Sandbox, Human Approval

Key Points

Why Privilege Design Matters

1. Least Privilege

2. Sandbox

Sign up to read the full article

Related Articles

Nous Research Updates Hermes Agent With a Blank Slate Mode That Pins Toolsets via platform_toolsets.cli and disabled_toolsets

Upload your product docs to BizNode's knowledge base. Your Telegram bot instantly answers customer questions from your own data

Your Selfie Was Fine. 3 Hidden Checks Just Failed You Anyway.

On-Device GenAI with Apple Core AI, Securing LLM Agents, & Mobile RPA

I Packaged My AI Productivity System Into a $1 Kit — Here's Everything In It

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer