Question 1

How much does o4-mini cost?

Accepted Answer

Input is $1.10 per million tokens and output is $4.40 per million tokens. A Batch API option cuts both prices by 50%.

Question 2

What is the context window?

Accepted Answer

200,000 tokens input, with a maximum of 100,000 tokens of output per response.

Question 3

How does it compare to o3?

Accepted Answer

o4-mini is about 10x cheaper than o3 and actually beats o3 on AIME math benchmarks, but scores below o3 on general intelligence indices and trails it on complex visual reasoning tasks like scientific figure interpretation.

Question 4

Is o4-mini still available?

Accepted Answer

It was retired from ChatGPT on February 13, 2026. API access continues until October 23, 2026. For new projects, OpenAI recommends migrating to GPT-5.4 mini or o3-mini.

Question 5

What does o4-mini struggle with?

Accepted Answer

General reasoning breadth, high-latency first response times (over 22 seconds), and verbose outputs that can be hard to skim. It also cannot be fine-tuned.

Question 6

Who should choose o4-mini?

Accepted Answer

Teams running math, coding, or document-processing workloads at scale who need strong domain performance without o3's price tag — provided they can tolerate the model's sunset timeline and don't need sub-second responsiveness.

o4 mini

About o4 mini

Best for

Specs & capabilities

Intelligence

Speed

Context window

Max output

Knowledge cutoff

Frequently asked questions

Related models