Question 1

How much does Claude Haiku 4.5 cost?

Accepted Answer

Input is $1.00 per million tokens and output is $5.00 per million tokens. Cached input drops to $0.10 per million tokens — a 90% discount that makes repeated or long-context queries significantly cheaper.

Question 2

What is the context window and output limit?

Accepted Answer

It supports a 200,000-token context window. Maximum output per response is 64,000 tokens, which may be a constraint for very long document generation tasks.

Question 3

How does it compare to Claude Sonnet 4.5?

Accepted Answer

Haiku 4.5 delivers roughly 90% of Sonnet 4.5's agentic coding capability at one-third the price and more than twice the speed. For most product workloads, the gap is hard to notice; for graduate-level reasoning benchmarks, Sonnet 4.5 still leads.

Question 4

Does it support reasoning and vision?

Accepted Answer

Yes. It's the first Haiku model to support extended thinking (controllable reasoning depth) and computer use. It also accepts image inputs alongside text.

Question 5

What is the knowledge cutoff?

Accepted Answer

February 2025. For most practical use cases this is recent enough, though a small number of competing models have a slightly more recent cutoff.

Question 6

Who should choose Haiku 4.5 over a larger model?

Accepted Answer

It's the right pick for teams building high-throughput products — chat, customer service, coding assistants, agentic pipelines — where response speed and token cost are primary constraints and raw academic benchmark scores are secondary.

Claude Haiku 4.5

About Claude Haiku 4.5

Best for

Specs & capabilities

Intelligence

Speed

Context window

Max output

Knowledge cutoff

Frequently asked questions

Related models