Question 1

What does o3 cost?

Accepted Answer

List price is $2.00 per million input tokens and $8.00 per million output tokens. However, hidden reasoning tokens are billed as output, so effective cost per task is typically 3–10x higher than those rates suggest. Prompt caching can reduce input costs by 60–80%.

Question 2

How large is the context window?

Accepted Answer

200,000 tokens input, with up to 100,000 tokens of output.

Question 3

What is o3 genuinely best at?

Accepted Answer

Step-by-step reasoning in STEM and software engineering. It scored 95.5% on SWE-Bench Verified and 96.7% on AIME 2024, and users consistently highlight its code quality and ability to handle multi-stage technical problems.

Question 4

Does o3 hallucinate?

Accepted Answer

Yes — users report that o3 sometimes invents false facts and inserts fabricated details with apparent confidence. Outputs in factual or research contexts should be verified. The o3-pro variant shows even higher hallucination rates.

Question 5

How does o3 compare to o3-mini?

Accepted Answer

o3 is the full-size model with higher benchmark scores and a 200K context window. o3-mini is a distilled, lower-cost variant with reduced latency but lower overall capability. Choose o3-mini when speed and cost matter more than peak reasoning depth.

Question 6

Who should skip o3?

Accepted Answer

Users who need fast responses or up-to-the-minute information should look elsewhere. o3's knowledge cuts off around June 2024, and its reasoning latency can be significant — the pro variant sometimes takes up to 15 minutes to respond.

o3

About o3

Best for

Specs & capabilities

Intelligence

Speed

Context window

Max output

Knowledge cutoff

Frequently asked questions

Related models