Question 1

What does GPT-5.4 mini cost?

Accepted Answer

$0.75 per million input tokens and $4.50 per million output tokens. Cached input drops to $0.075 per million — a 90% discount — making repeated-context workloads significantly cheaper.

Question 2

How large is the context window?

Accepted Answer

400,000 tokens input with up to 128,000 tokens of output, suitable for long documents, large codebases, or extended multi-turn sessions.

Question 3

How does it compare to GPT-5.4?

Accepted Answer

GPT-5.4 mini scores 54.4% on SWE-Bench Pro versus GPT-5.4's 57.7%, while costing roughly six times less. The main gap is maximum reasoning depth — GPT-5.4 mini caps at reasoning_effort 'high' while GPT-5.4 supports 'pro'.

Question 4

What is it genuinely not good at?

Accepted Answer

Deep reasoning tasks with little margin for error, complex spatial reasoning (such as interpreting 3D shapes from 2D patterns), and strict instruction-following under high load — it can inconsistently apply or ignore prompt rules.

Question 5

Does it support images and tool use?

Accepted Answer

Yes. It accepts text and image inputs, and supports tool use, function calling, web search, file search, computer use, and extended thinking up to reasoning_effort 'high'.

Question 6

Who should choose GPT-5.4 mini over GPT-5 mini?

Accepted Answer

Anyone upgrading from GPT-5 mini will get meaningfully better quality — the Arena ELO gap is 61 points — at similar or lower cost with noticeably faster output speeds (around 180 tokens per second).

GPT-5.4 mini

About GPT-5.4 mini

Best for

Specs & capabilities

Intelligence

Speed

Context window

Max output

Knowledge cutoff

Supported endpoints

Input and output

Availability notes

Frequently asked questions

Related models