Question 1

What does GPT-4.1 mini cost?

Accepted Answer

Input is $0.40 per million tokens and output is $1.60 per million tokens. With prompt caching, input drops to $0.10 per million tokens. The blended effective rate works out to roughly $0.31 per million tokens.

Question 2

How large is the context window?

Accepted Answer

1 million tokens — the same as full GPT-4.1. Keep in mind that retrieval accuracy drops noticeably at the very high end of that window, so it is best suited to documents well under the theoretical maximum.

Question 3

Does it support images?

Accepted Answer

Yes. It accepts both text and image inputs and performs comparably to GPT-4.1 on vision tasks at a fraction of the price. It does not support audio input or output.

Question 4

How does it compare to full GPT-4.1?

Accepted Answer

It costs about 80% less and responds faster, but trades off some reasoning depth and coding capability. On SWE-Bench Verified it scores 23.6% versus GPT-4.1's higher mark, and complex multi-step reasoning is less reliable.

Question 5

Who should use GPT-4.1 mini instead of a larger model?

Accepted Answer

Teams building latency-sensitive products, high-throughput pipelines, or cost-constrained applications where a mid-tier model is good enough — especially instruction-following, document, and vision tasks.

Question 6

What is its knowledge cutoff?

Accepted Answer

May to June 2024. It will not have awareness of events, model releases, or news after that point.

GPT-4.1 mini

About GPT-4.1 mini

Best for

Specs & capabilities

Intelligence

Speed

Context window

Max output

Knowledge cutoff

Frequently asked questions

Related models