Opus 4.8 is live: better reasoning, same API price
Claude Opus 4.8 generally available — improved agentic reasoning and "faithfulness" (more measured admissions of uncertainty), unchanged API rates, fast mode is now 3x cheaper, and the model is live on AWS Bedrock on day one
Until yesterday Opus 4.7 was the top of the Claude stack, and fast mode cost the same as the standard rate. Long agentic runs overnight were genuinely expensive — many teams defaulted to Sonnet mid-task to keep costs down. Opus 4.8 adds a focus on faithfulness: the model is now more likely to admit uncertainty rather than guessing. API and Bedrock access went live today.
If you run agentic pipelines at volume, the fast-mode price cut is the headline — one-third the cost means overnight batch jobs just got much cheaper. The faithfulness improvement matters most in high-stakes work where a confident wrong answer is worse than an uncertain right one. For light chat use, the difference from 4.7 will be hard to notice.