GPT-5.4 mini and GPT-5.4 nano, which can describe 76,000 photos for $52

Simon Willison's Blog / 3/18/2026

📰 NewsTools & Practical UsageIndustry & Market MovesModels & Research

Read original →

共有:

Key Points

OpenAI announced GPT‑5.4 mini and nano to join GPT‑5.4, released two weeks after the initial GPT‑5.4 launch.
GPT‑5.4-nano reportedly outperforms GPT‑5.4-mini at maximum reasoning and the mini is 2x faster than before.
The post includes a token-based pricing table showing nano and mini costs and notes that nano is cheaper than Google Gemini 3.1 Flash-Lite for certain workloads.
A practical example demonstrates GPT‑5.4-nano describing a real photo, illustrating real-world usage and speed.

Simon Willison’s Weblog

Sponsored by: CodeRabbit — Planner helps 10x your coding agents while minimizing rework and AI slop. Try Now.

GPT-5.4 mini and GPT-5.4 nano, which can describe 76,000 photos for $52

17th March 2026

OpenAI today: Introducing GPT‑5.4 mini and nano. These models join GPT-5.4 which was released two weeks ago.

OpenAI’s self-reported benchmarks show the new 5.4-nano out-performing their previous GPT-5 mini model when run at maximum reasoning effort. The new mini is also 2x faster than the previous mini.

Here’s how the pricing looks—all prices are per million tokens. gpt-5.4-nano is notably even cheaper than Google’s Gemini 3.1 Flash-Lite:

Model	Input	Cached input	Output
gpt-5.4	$2.50	$0.25	$15.00
gpt-5.4-mini	$0.75	$0.075	$4.50
gpt-5.4-nano	$0.20	$0.02	$1.25
Other models for comparison
Claude Opus 4.6	$5.00	-	$25.00
Claude Sonnet 4.6	$3.00	-	$15.00
Gemini 3.1 Pro	$2.00	-	$12.00
Claude Haiku 4.5	$1.00	-	$5.00
Gemini 3.1 Flash-Lite	$0.25	-	$1.50

I used GPT-5.4 nano to generate a description of this photo I took at the John M. Mossman Lock Collection:

Description below

llm -m gpt-5.4-nano -a IMG_2324.jpeg 'describe image'

Here’s the output:

The image shows the interior of a museum gallery with a long display wall. White-painted brick walls are covered with many framed portraits arranged in neat rows. Below the portraits, there are multiple glass display cases with dark wooden frames and glass tops/fronts, containing various old historical objects and equipment. The room has a polished wooden floor, hanging ceiling light fixtures/cords, and a few visible pipes near the top of the wall. In the foreground, glass cases run along the length of the room, reflecting items from other sections of the gallery.

That took 2,751 input tokens and 112 output tokens, at a cost of 0.069 cents (less than a tenth of a cent). That means describing every single photo in my 76,000 photo collection would cost around $52.44.

I released llm 0.29 with support for the new models.

Then I had OpenAI Codex loop through all five reasoning effort levels and all three models and produce this combined SVG grid of pelicans riding bicycles (generation transcripts here). I do like the gpt-5.4 xhigh one the best, it has a good bicycle (with nice spokes) and the pelican has a fish in its beak!

Posted 17th March 2026 at 7:39 pm · Follow me on Mastodon, Bluesky, Twitter or subscribe to my newsletter

💡 Insights using this article

This article is featured in our daily AI news digest — key takeaways and action items at a glance.

📅 3/18DailyView insight →

Astral to Join OpenAI

Dev.to

I Built a MITM Proxy to See What Claude Code Actually Sends to Anthropic

Dev.to

Your AI coding agent is installing vulnerable packages. I built the fix.

Dev.to

ChatGPT Prompt Engineering for Freelancers: Unlocking Efficient Client Communication

Dev.to

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.

Reddit r/LocalLLaMA

GPT-5.4 mini and GPT-5.4 nano, which can describe 76,000 photos for $52

Key Points

Simon Willison’s Weblog

GPT-5.4 mini and GPT-5.4 nano, which can describe 76,000 photos for $52

More recent articles

Monthly briefing

💡 Insights using this article

Related Articles

Astral to Join OpenAI

I Built a MITM Proxy to See What Claude Code Actually Sends to Anthropic

Your AI coding agent is installing vulnerable packages. I built the fix.

ChatGPT Prompt Engineering for Freelancers: Unlocking Efficient Client Communication

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer