Question 1

What does GPT-5.4 cost?

Accepted Answer

Input is $2.50 per million tokens, output is $15.00 per million tokens. Cached input drops to $0.25 per million. Note that prompts over 272K tokens trigger 2x input and 1.5x output charges for the entire session, and the model tends to generate more output tokens than average, so real-world costs can be higher than the per-token rate implies.

Question 2

How large is the context window?

Accepted Answer

The maximum is 1.05 million tokens with up to 128K output tokens. However, the default context window is 272K tokens — you must explicitly configure the model to use extended context. Sessions exceeding 272K are billed at a premium rate.

Question 3

What is GPT-5.4 best at?

Accepted Answer

Coding, structured professional work (financial modeling, legal analysis, spreadsheet tasks), multi-step agentic workflows, and long-context document analysis. It scored 83% on professional comparison benchmarks and 87.3% on investment banking modeling tasks.

Question 4

What are its main weaknesses?

Accepted Answer

It takes prompts too literally and can miss the underlying intent of ambiguous requests — an area where Claude Opus tends to do better. Its writing personality is frequently described as verbose and mechanical, with unnecessary follow-up suggestions. It can also stop short in very long agentic workflows.

Question 5

How does it compare to GPT-5.2?

Accepted Answer

GPT-5.4 is a meaningful upgrade: professional benchmark scores improved from 70.9% to 83%, investment banking modeling from 68.4% to 87.3%, and the model gained native computer-use capabilities entirely absent in GPT-5.2. It is more expensive, however.

Question 6

Has GPT-5.4 been superseded?

Accepted Answer

GPT-5.5 was released in June 2026 with better token efficiency, improved long-context reasoning, and stronger agentic autonomy. GPT-5.4 remains available and is a practical option for cost-conscious users who do not need those specific improvements.

GPT-5.4

About GPT-5.4

Best for

Specs & capabilities

Intelligence

Speed

Context window

Knowledge cutoff

Frequently asked questions

Related models