Question 1

What does it cost?

Accepted Answer

Standard context (up to 200K tokens) costs $5.00 per million input tokens and $25.00 per million output tokens. Using the extended 1M-token window doubles the input price to $10.00 per million, with output at $37.50 per million. Prompt caching brings reused-context costs down significantly.

Question 2

How large is the context window?

Accepted Answer

Up to 1 million tokens in beta — roughly 750,000 words per session. The standard, non-beta tier supports 200K tokens. Maximum output per request is 128K tokens.

Question 3

What is it best at?

Accepted Answer

Deep research workflows, large-codebase reasoning, and agentic task execution. It holds the top score on Terminal-Bench 2.0 (agentic coding), BrowseComp (information retrieval), and GPQA Diamond (91.3% on PhD-level science questions).

Question 4

Are there any notable weaknesses?

Accepted Answer

Output speed sits at 38.8 tokens per second, well below the frontier median of 62.3 t/s, which makes it feel slow for interactive conversations. Users have also reported that context quality degrades noticeably around 20-40% of the 1M-token window, not just at the ceiling.

Question 5

How does it compare to Claude Sonnet 4.6?

Accepted Answer

Opus 4.6 carries a higher intelligence index and leads on complex reasoning and agentic benchmarks, but costs significantly more and generates output more slowly. Sonnet 4.6 is the better fit for everyday tasks and higher-volume workflows where speed and cost matter.

Question 6

What is the knowledge cutoff?

Accepted Answer

Training data runs through May 2025, with reliable knowledge up to August 2025. Events between May 2025 and the February 2026 release date fall outside what the model can speak to with confidence.

Claude Opus 4.6

About Claude Opus 4.6

Best for

Specs & capabilities

Intelligence

Speed

Context window

Max output

Knowledge cutoff

Frequently asked questions

Related models