Anthropic API Pricing: Every Model, Every Mode, What You’ll Actually Pay (2026)

The Anthropic API is how developers and businesses access Claude programmatically — and the pricing model is fundamentally different from the subscription tiers. Instead of a flat monthly fee, you pay per token, per model, per call. This is the complete breakdown of Anthropic API pricing as of April 2026: every model, every pricing mode, and how to calculate what you’ll actually spend.

The short version: Haiku is the cheapest and fastest. Sonnet is the workhorse. Opus is for complex reasoning where quality is the priority. The Batch API cuts all prices roughly in half for non-time-sensitive work. You prepay credits — no surprise bills.

Anthropic API Pricing by Model (April 2026)

All API pricing is per million tokens. Input tokens are what you send to the model; output tokens are what Claude returns. Output consistently costs more than input across all models.

Model	Input (per M tokens)	Output (per M tokens)	Best for
Claude Haiku	~$1.00	~$5.00	High-volume, latency-sensitive tasks
Claude Sonnet	~$3.00	~$5.00	Production workloads, content generation
Claude Opus	~$5.00	~$25.00	Complex reasoning, highest quality output

These are approximate figures — Anthropic publishes exact current rates on their pricing page and updates them with each model generation. Always verify before building cost projections into a production system.

What Is a Token?

A token is the unit of text the API processes. One token is roughly four characters of English text — or about three-quarters of a word. A 750-word article is approximately 1,000 tokens. A 10-page document might be 5,000–8,000 tokens depending on formatting.

Both your input (the prompt, system instructions, conversation history) and Claude’s output (the response) consume tokens. In a long multi-turn conversation, the entire conversation history is re-sent with each message — so token costs compound over long sessions.

The Batch API: ~50% Off for Non-Real-Time Work

Anthropic’s Batch API processes requests asynchronously and returns results within 24 hours. In exchange, you get roughly half off listed token rates across all models. This is the highest-leverage pricing lever available to developers running content pipelines, data processing, or any workload where real-time response isn’t required.

Model	Standard Input	Batch Input (~50% off)
Haiku	~$1.00/M	~$0.50/M
Sonnet	~$3.00/M	~$1.50/M
Opus	~$5.00/M	~$7.50/M

If you’re running more than 20 API calls that don’t need instant responses, the Batch API should be your default.

How API Billing Works

The Anthropic API does not operate on a subscription. You load prepaid credits into the Anthropic Console — your developer dashboard — and those credits draw down as you use the API. When credits run out, API calls stop until you add more. There’s no bill that arrives at the end of the month with a surprise on it.

Usage reporting in the Console shows a breakdown by model, by date, and by API key, so you can see exactly where token spend is going across different projects or team members.

Context Window and Pricing

Context window size affects how much you can send in a single API call — it doesn’t directly change pricing per token. However, larger context windows mean you can include more conversation history, longer documents, or more detailed system prompts, which increases input token counts and therefore cost per call.

Claude’s context windows as of April 2026 are generous across all tiers — Haiku, Sonnet, and Opus all support 200K token context windows, which covers most production use cases without forced truncation.

API vs. Subscription: Which Do You Need?

Use the API if: you’re building an application on top of Claude, running automated pipelines, integrating Claude into your own tools, or processing data programmatically.

Use Pro/Max if: you’re an individual using Claude through the web interface or Claude Code for your own work — not building something for others to use.

You might need both if: you use Claude daily for personal work (subscription) and also build Claude-powered tools for clients (API). They’re billed separately and don’t share limits.

Frequently Asked Questions

How much does the Anthropic API cost per month?

There’s no monthly fee for the API itself — you pay per token used. Costs depend entirely on which model you use, how many calls you make, and how long your prompts and responses are. Light usage on Haiku can cost just a few dollars. Heavy Opus usage for complex tasks costs significantly more. Load credits in advance via the Anthropic Console.

What is the cheapest Anthropic API model?

Claude Haiku is the least expensive model at approximately $1.00 per million input tokens. It’s optimized for speed and cost, making it the right choice for high-volume tasks where response quality doesn’t need to be at Opus level — classification, extraction, summarization, routing logic.

Does Anthropic offer API discounts for volume?

The Batch API offers roughly 50% off standard token rates for asynchronous workloads. For very high-volume usage, Anthropic also has enterprise agreements with custom pricing — contact their sales team. Standard token pricing doesn’t automatically tier down with volume outside of those two options.

How is Anthropic API pricing compared to OpenAI?

At the cheapest tier, OpenAI’s GPT-4o mini is less expensive per token than Claude Haiku. At the mid tier, Claude Sonnet and GPT-4o are in a similar range. At the top tier, Claude Opus and GPT-4o are comparable in price. The right choice depends on the task — not every model performs identically on every workload, so cost per token is only part of the calculation.

Do API tokens and subscription usage share limits?

No. API usage and Claude.ai subscription usage are entirely separate. Your Pro or Max subscription usage doesn’t count against API credits, and API credits don’t increase your subscription limits. They’re billed and tracked independently through different systems.