Anthropic says Claude Code's usage drain comes down to peak-hour caps and ballooning contexts

THE DECODER / 4/3/2026

💬 OpinionDeveloper Stack & InfrastructureIdeas & Deep AnalysisTools & Practical Usage

Key Points

  • Anthropic attributes Claude Code’s fast limit depletion to two main factors: peak-hour usage caps and users running into large context windows (“ballooning contexts”).
  • The piece explains that token consumption becomes more pronounced when users’ sessions include overly long or repeatedly expanded context.
  • Anthropic also provides practical guidance aimed at reducing token usage, helping users stay within rate/limit constraints more effectively.
  • The article frames these findings as a usage-management issue rather than a fundamental model regression, focusing on how users configure and structure their prompts/workflows.

Anthropic explains why Claude Code users have been burning through their limits so fast and shares tips to cut down on token usage.

The article Anthropic says Claude Code's usage drain comes down to peak-hour caps and ballooning contexts appeared first on The Decoder.