Gemini 3.5 Flash beats last flagship at a third of the cost
Gemini 3.5 Flash unveiled at Google I/O 2026: matches or beats 4-month-old Gemini 3.1 Pro on nearly every major benchmark (Terminal-Bench 2.1 76.2%, GDPval-AA 1656 Elo, MCP Atlas 83.6%, CharXiv 84.2%) while delivering output 4x faster (12x in Antigravity) at 1/3 to 1/2 the cost. CEO Sundar Pichai claimed enterprises running 1 trillion tokens/day could save $1B+ annually
Four months ago, Gemini 3.1 Pro was the flagship at $2/$12 per 1M tokens. Flash-tier models were seen as 'fast but compromised' — not reliable for accuracy-sensitive workloads. For large-scale API users, choosing between cost and quality was a constant negotiation that often landed on the pricier tier.
The Flash-vs-Pro choice is effectively gone for Gemini. If you are running automation pipelines or retrieval-augmented workflows on Pro, this is the moment to re-evaluate Flash. At trillion-token-per-day scale the annual savings run into the billions. For individuals who open the app a few times a week, the practical difference is negligible.