What is the rate limit for Claude Managed Agents?

60 requests per minute for create endpoints, 600 requests per minute for read endpoints. Organization-level limits still apply.

Are there volume discounts for Claude Managed Agents?

Volume discounts are available for high-volume users but are negotiated case-by-case through Anthropic enterprise sales.

What does web search cost in a Managed Agents session?

$10 per 1,000 searches ($0.01 per search), billed separately from session runtime and token costs.

Claude Managed Agents — Complete Pricing Reference 2026

Q: Is the $0.08/session-hour charge in addition to token costs?

Yes, in addition to. You pay both standard token rates for all input and output tokens, plus $0.08 per hour of active session runtime as separate line items.

You opened this tab because you need a number you can actually use. Not a vibe, not “it depends.” A real pricing breakdown you can put in a spreadsheet, a budget request, or a Slack message to your CTO.

This is that page. Every pricing variable for Claude Managed Agents in one place, verified against Anthropic’s current documentation as of April 2026. Bookmark it. The beta will update; so will this.

Quick Reference: The Formula

Total Cost = Token Costs + Session Runtime ($0.08/hr) + Optional Tools
Session runtime only accrues while status = running. Idle time is free.

The Two Cost Dimensions

Claude Managed Agents bills on exactly two dimensions: tokens and session runtime. Every pricing question you have collapses into one of these two buckets.

Dimension 1: Token Costs

These are identical to standard Claude API pricing. You pay the same rates you’d pay calling the Messages API directly. No Managed Agents markup on tokens. Current rates for the models most commonly used in agent work:

Claude Sonnet 4.6: ~$3/million input tokens, ~$15/million output tokens
Claude Opus 4.6: higher rates apply — check platform.claude.com/docs/en/about-claude/pricing for current figures
Prompt caching: same multipliers as standard API — cache hits dramatically reduce input token costs on long sessions with stable system prompts

The implication: a token-heavy agent with a large system prompt that runs the same context repeatedly benefits significantly from prompt caching, and that benefit carries over unchanged into Managed Agents.

Dimension 2: Session Runtime — $0.08/Session-Hour

This is the Managed Agents-specific charge. You pay $0.08 per hour of active session runtime, metered to the millisecond.

The critical word is active. Runtime only accrues while your session’s status is running. The following do not count toward your bill:

Time spent waiting for your next message
Time waiting for a tool confirmation
Idle time between tasks
Rescheduling delays
Terminated session time

This is not how you’d bill a virtual machine. It’s closer to how AWS Lambda bills — you pay for execution, not reservation. An agent that “runs” for 8 hours but spends 6 of those hours waiting on human input has a very different bill than one running continuous autonomous loops.

Optional Tool Costs

Web Search: $10 per 1,000 Searches

If your agent uses web search, each search costs $10/1,000 — that’s $0.01 per search. For most agents, this is negligible. For a research agent running hundreds of searches per session, it becomes a line item worth modeling separately.

Code Execution: Included in Session Runtime

Code execution containers are included in your $0.08/session-hour charge. You’re not separately billed for container hours on top of session runtime. This is explicitly stated in Anthropic’s docs and represents meaningful savings versus provisioning your own compute.

Worked Cost Examples

Example 1: Daily Research Agent

Runs once per day. 30 minutes of active execution. Processes 10 documents, outputs a summary report. Moderate token volume.

Session runtime: 0.5 hrs × $0.08 = $0.04/day (~$1.20/month)
Tokens (estimate): 50K input + 5K output with Sonnet 4.6 = ~$0.23/run (~$7/month)
Total: ~$8–10/month

Example 2: Weekly Batch Content Pipeline

Runs 3x/week. 2-hour active sessions. Processes multiple documents, generates structured outputs.

Session runtime: 2 hrs × $0.08 × 12 sessions/month = $1.92/month
Tokens: depends on content volume — typically $10–40/month
Total: ~$12–42/month

Example 3: Customer Support Agent (Business Hours)

Active during business hours, handling tickets. 8 hours/day active, 5 days/week.

Session runtime: 8 hrs × $0.08 × 22 days = $14.08/month in runtime
Tokens: highly variable by ticket volume — the dominant cost driver at scale
Runtime cost alone: ~$14/month — tokens are likely 5–20x this depending on volume

Example 4: 24/7 Always-On Agent

The maximum theoretical runtime exposure. Continuous operation, no idle time.

Session runtime: 24 hrs × $0.08 × 30 days = $57.60/month
In practice, no agent has zero idle time — real cost will be lower
Token costs at this scale become the dominant factor by a wide margin

Anthropic’s Official Example (from their docs)

A one-hour coding session using Claude Opus 4.6 consuming 50,000 input tokens and 15,000 output tokens: session runtime = $0.08. With prompt caching active and 40,000 of those tokens as cache reads, the token costs drop significantly. The runtime charge stays flat at $0.08 regardless of caching.

What’s Not Billed in Managed Agents

A few things that might seem like costs but aren’t:

Infrastructure provisioning: Anthropic handles hosting, scaling, and monitoring at no additional charge
Container hours: Explicitly not separately billed on top of session runtime
State management and checkpointing: Included in the session runtime charge
Error recovery and retry logic: Anthropic’s infrastructure problem, not yours

Rate Limits

Managed Agents has specific rate limits separate from standard API limits:

Create endpoints: 60 requests/minute
Read endpoints: 600 requests/minute
Organization-level limits still apply
For higher limits, contact Anthropic enterprise sales

How to Access Managed Agents Pricing

Managed Agents is available to all Anthropic API accounts in public beta. No separate signup, no premium tier gate. You need the managed-agents-2026-04-01 beta header in your API requests — the Claude SDK adds this automatically.

For high-volume agent applications, Anthropic’s enterprise sales team negotiates custom pricing arrangements. Contact them at [email protected] or through the Claude Console.

The Pricing Signals Worth Noting

Anthropic recently ended Claude subscription access (Pro/Max) for third-party agent frameworks, requiring those users to switch to pay-as-you-go API pricing. This signals a deliberate strategy: consumer subscriptions are for human-paced interactions; agent workloads route through the API. The $0.08/session-hour rate exists in that context — it’s infrastructure pricing for compute that runs beyond human attention spans.

The session-hour model also signals something about Anthropic’s infrastructure cost structure. They’re pricing on active execution time because that’s what actually taxes their systems. Idle sessions don’t cost them much; active agents do. The billing model follows the actual resource consumption pattern.

Frequently Asked Questions

Is the $0.08/session-hour charge in addition to token costs, or does it replace them?

In addition to. You pay both: standard token rates for all input and output tokens, plus $0.08 per hour of active session runtime. They’re separate line items.

Does prompt caching work in Managed Agents sessions?

Yes. Prompt caching multipliers apply identically to Managed Agents sessions as they do to standard API calls. If your agent has a large, stable system prompt, caching it can significantly reduce input token costs.

What happens if my session crashes? Am I billed for the crashed time?

Runtime accrues only while status is running. Terminated sessions stop accruing. Anthropic’s infrastructure handles checkpointing and crash recovery — the session state is preserved even if the session terminates unexpectedly.

Can I use Managed Agents on the free API tier?

Managed Agents is available to all Anthropic API accounts in public beta, but standard tier access and rate limits apply. Free API tier users receive a small credit for testing.

How does this compare to running agents on my own infrastructure?

See our full breakdown: Build vs. Buy: The Real Infrastructure Cost of Claude Managed Agents. Short version: the $0.08/hour is almost certainly cheaper than provisioning and maintaining equivalent compute, but you trade control and data locality for that simplicity.