I Accidentally Spent $400 on GPT-4o in One Month. Here's How to Never Do That.

Dev.to / 6/4/2026

💬 OpinionDeveloper Stack & InfrastructureSignals & Early TrendsIdeas & Deep AnalysisTools & Practical Usage

共有:

Key Points

The author describes a GPT-4o-powered support chatbot project whose monthly API bill ($400) greatly exceeded expectations because a long system prompt (600 tokens) was included in every API call.
They estimate that the repeated system prompt added roughly 4.8M extra input tokens per day, driving hundreds of dollars of unexpected monthly cost.
To prevent similar surprises, the article introduces a free LLM API Cost Calculator that supports 18 models, multiple currencies, live token counting, and model-by-model cost comparisons for a given workload.
The piece explains a key pricing mental model: output tokens typically cost 3–5× more than input tokens, and the model bill depends on the workload (e.g., sentiment classification often becomes input-dominated, while support chatbots are output-dominated).
It promises a walkthrough of how to estimate costs properly and optimize either system prompt size or response length (max_tokens) depending on which token type dominates.

Continue reading this article on the original site.

AI Business

Dev.to

Dev.to

The Verge

Dev.to