Question 1

How much does GPT-5 nano cost?

Accepted Answer

Input is $0.05 per million tokens and output is $0.40 per million tokens. Cached input hits receive an 80% discount, dropping to $0.01 per million tokens — a significant saving for workflows that reuse long context repeatedly.

Question 2

What is the context window?

Accepted Answer

GPT-5 nano supports up to 400,000 input tokens and can generate up to 128,000 output tokens, making it well suited for processing full documents or long conversation histories in a single call.

Question 3

Is GPT-5 nano good for real-time chat applications?

Accepted Answer

No. Its time-to-first-token averages around 84 seconds, which is far above the median of about one second for comparable models. It is better suited to batch and asynchronous workloads than interactive, low-latency use cases.

Question 4

How does it compare to GPT-5 mini?

Accepted Answer

GPT-5 nano is roughly 5x cheaper on blended pricing and faster in raw token throughput, but trails GPT-5 mini by around 10 percentage points on structured tasks and is meaningfully weaker at multi-step reasoning. Choose nano for bulk pipelines where cost dominates; choose mini when accuracy or reasoning depth matters more.

Question 5

What is GPT-5 nano's knowledge cutoff?

Accepted Answer

Training data ends May 30, 2024. Events and developments from mid-2024 onward are outside its knowledge, which is worth considering for tasks that depend on current information.

Question 6

Can I use GPT-5 nano in the ChatGPT web interface?

Accepted Answer

No. Unlike GPT-5 mini, nano is only accessible via the OpenAI API. It is not available in the ChatGPT consumer interface.

GPT-5 nano

About GPT-5 nano

Best for

Specs & capabilities

Intelligence

Speed

Context window

Max output

Knowledge cutoff

Frequently asked questions

Related models