Question 1

What does it cost?

Accepted Answer

Pricing ranges from $1.25–$2.00 per 1M input tokens and $2.50–$6.00 per 1M output tokens depending on tier. Cached input is $0.20 per 1M tokens. Check xAI's current documentation, as pricing has shifted since the March 2026 launch.

Question 2

How large is the context window?

Accepted Answer

The official xAI spec for the non-reasoning variant lists 1 million tokens. Some sources cite 2M for the full Grok 4.20 architecture, but 1M is the conservative documented figure for this specific model.

Question 3

What is the model genuinely bad at?

Accepted Answer

Multi-step logical reasoning is a documented weakness — a long-context reasoning score of 18% and limited chain-of-thought transparency make it a poor fit for problems that require auditable step-by-step logic. Community feedback also highlights gaps in complex code generation.

Question 4

How does it differ from the Grok-4.20 reasoning variant?

Accepted Answer

The reasoning variant runs extended chain-of-thought and is better suited for difficult logical or mathematical problems. This non-reasoning variant disables that process entirely, trading explainability for significantly higher output speed and lower cost per token.

Question 5

Does it have access to current information?

Accepted Answer

Yes. The Harper agent provides live integration with X (Twitter) data, allowing the model to draw on real-time event feeds. Its base knowledge cutoff is November 2024.

Question 6

Who should choose this model?

Accepted Answer

Teams building latency-sensitive production systems — streaming APIs, real-time chat, live content pipelines — who need strong factual accuracy but do not require visible reasoning traces or complex multi-step logical outputs.

Grok-4.20 Non-Reasoning

About Grok-4.20 Non-Reasoning

Best for

Specs & capabilities

Intelligence

Speed

Context window

Knowledge cutoff

Supported endpoints

Input and output

Availability notes

Frequently asked questions

Related models