Question 1

What does GPT OSS 20b cost?

Accepted Answer

OpenAI does not publish official API pricing for this model. Third-party providers charge roughly $0.05 per million input tokens and $0.14–$0.20 per million output tokens. The Apache 2.0 license also allows free local deployment with no per-token cost.

Question 2

What is the context window?

Accepted Answer

131,072 tokens (128k). Note that long-context reasoning is a documented weak point — the model scores only 14% on the AA-LCR benchmark, so it is not well-suited for complex multi-step reasoning spread across large documents.

Question 3

Does it support images or other media?

Accepted Answer

No. GPT OSS 20b is text-only. There is no vision, audio, or file-upload support.

Question 4

How does it compare to o3-mini?

Accepted Answer

On mathematical benchmarks and TauBench tool use it matches or approaches o3-mini, but in complex multi-step agentic workflows — especially extended conversations with many available tools — o3-mini remains more reliable. GPT OSS 20b's advantage is that it can run locally and costs far less.

Question 5

What is the knowledge cutoff?

Accepted Answer

May 2024. It cannot answer questions about events after that date.

Question 6

Can I fine-tune or modify the weights?

Accepted Answer

Yes. The model is released under Apache 2.0, which permits fine-tuning, modification, and commercial redistribution without copyleft restrictions.

GPT OSS 20b

About GPT OSS 20b

Best for

Specs & capabilities

Intelligence

Speed

Context window

Max output

Knowledge cutoff

Frequently asked questions

Related models