Tag: Claude AI

  • Claude Agent SDK Dual-Bucket Billing: What Changes June 15, 2026 (And Why It Matters)

    Claude Agent SDK Dual-Bucket Billing: What Changes June 15, 2026 (And Why It Matters)

    Last refreshed: May 15, 2026

    If you’ve been running Claude Code’s claude -p command in production, kicking off background jobs through the Claude Agent SDK, or wiring the Agent SDK into a third-party app, the way you pay for that work is about to change.

    Starting June 15, 2026, Anthropic is splitting Claude subscription billing into two separate buckets: one for the things you do interactively (Claude.ai chat, Claude Code in your terminal, Claude Cowork), and a brand-new credit pool that only covers programmatic, autonomous, and SDK-driven work.

    This is a meaningful shift. It’s also one of the most under-explained changes Anthropic has made to subscription pricing this year. If you don’t know about it before June 15, you can find yourself with stopped automations, surprise overage charges, or both.

    This guide walks through exactly what’s changing, what the credits cover, what they don’t cover, what each plan gets, and how to plan for it before the cutover.

    The short version

    Claude subscription plans (Pro, Max, Team, Enterprise) currently have one shared usage limit. Whether you’re chatting with Claude on the web, using Claude Code in your terminal, or running unattended jobs through the Agent SDK, all of that draws from the same plan-level allowance.

    On June 15, 2026, Anthropic is separating those two modes of use:

    • Bucket 1 — Interactive use: Claude.ai chat, Claude Code in the terminal/IDE, Claude Cowork. Uses your existing subscription usage limits, exactly as before.
    • Bucket 2 — Agent SDK monthly credit: A separate, dollar-denominated credit pool. Funds the Claude Agent SDK, the claude -p non-interactive command, the Claude Code GitHub Actions integration, and any third-party app that authenticates via the Agent SDK.

    The two buckets do not commingle. Agent SDK work cannot draw from your interactive subscription limit, and interactive use cannot draw from your Agent SDK credit. If you exhaust your Agent SDK credit and don’t have extra usage enabled, your background jobs simply stop until the credit refreshes the following month.

    What each plan gets

    Here is the official monthly Agent SDK credit by plan, as published in Anthropic’s Help Center (verified May 15, 2026):

    • Pro: $20/month
    • Max 5x: $100/month
    • Max 20x: $200/month
    • Team — Standard seats: $20/month per seat
    • Team — Premium seats: $100/month per seat
    • Enterprise — usage-based: $20/month
    • Enterprise — seat-based Premium seats: $200/month

    Important detail buried in the announcement: Enterprise seat-based plans on Standard seats are not eligible to claim the Agent SDK credit at all. If you administer one of those plans and have engineers running automation, that’s a gap to plan around.

    What the credit covers (and what it doesn’t)

    Anthropic’s documentation is specific about what counts as Agent SDK use, so this is worth reading carefully.

    Covered by the credit:

    • Claude Agent SDK usage in your own Python or TypeScript projects
    • The claude -p command in Claude Code (non-interactive mode)
    • The Claude Code GitHub Actions integration
    • Third-party apps that authenticate with your Claude subscription through the Agent SDK

    Not covered (these still draw from your normal subscription limits):

    • Interactive Claude Code in your terminal or IDE
    • Claude conversations on web, desktop, or mobile
    • Claude Cowork
    • Other features that draw from extra usage

    The plain-English version: if a human is sitting at the keyboard waiting for the response, that’s interactive use. If a script kicks off the work and the result lands somewhere else later, that’s Agent SDK use.

    How the credit actually works in practice

    Five mechanics matter for budgeting:

    1. Per-user, never pooled. Each eligible user on a Team or Enterprise plan claims their own credit. There is no organization-level pool. Credits cannot be transferred between users, shared, or stockpiled across accounts.

    2. Refreshes monthly with the billing cycle. Whatever you don’t spend in a given month evaporates. Unused credits do not roll over.

    3. One-time opt-in. You claim your credit through your Claude account once. After that initial claim, it refreshes automatically each cycle.

    4. Drains first, before any other source. When an Agent SDK request fires, it pulls from your monthly credit before any other paid usage source kicks in. This is good — it means you actually use what you’ve already paid for.

    5. After the credit, requests either flow to extra usage or stop entirely. When your monthly credit hits zero, additional Agent SDK requests draw from extra usage at standard API rates — but only if you have extra usage enabled. If you haven’t enabled extra usage, your Agent SDK requests stop until the next refresh.

    That last point is the one most likely to bite teams. If you’re running a daily cron job through the Agent SDK and you don’t enable extra usage, the day your credit runs out is the day your automation goes silent — without obvious warning if you’re not watching the credit balance.

    Why Anthropic is doing this

    Anthropic frames this as separating individual experimentation from production automation. From the Help Center documentation: “The Agent SDK monthly credit is sized for individual experimentation and automation. Teams running shared production automation should use the Claude Developer Platform with an API key for predictable pay-as-you-go billing.”

    The translation: a single user’s $20 or $200 of Agent SDK credit was never going to cover a real production workload anyway. Anthropic is making explicit what was already true under the hood — that a subscription was a chat product, and serious unattended automation belongs on the API.

    What this also does, structurally, is protect interactive subscription users from getting their experience degraded by heavy autonomous workloads sharing the same pool. If you’ve ever hit a subscription rate limit during a normal chat session because something else on your account was burning tokens in the background, this change removes that failure mode.

    What you should do before June 15, 2026

    If you run any unattended Claude work (the most important group):

    Audit every place your subscription is being used by something other than a human at a keyboard. The big four to check:

    • claude -p commands in cron jobs, CI pipelines, or shell scripts
    • Claude Code GitHub Actions workflows
    • Custom Python or TypeScript projects using the Agent SDK
    • Any third-party tool that asks for “Sign in with Claude” — those go through the Agent SDK

    For each one, estimate dollar consumption per day at standard API rates. If the total approaches or exceeds your plan’s Agent SDK monthly credit, you have three options: enable extra usage to allow overage, move that workload to a Claude Developer Platform API key (more predictable for sustained loads), or downsize the workload itself.

    If you administer a Team or Enterprise plan:

    Eligible users on your team will receive an email with claim instructions before June 15, 2026. You don’t need to take action yourself, but it’s worth communicating internally that the credits are per-user, can’t be pooled, and that any team-wide automation should be on an API key, not on a subscription seat.

    If you’re a solo Pro or Max user who only chats with Claude:

    You probably don’t need to do anything. The split affects you only if you’re running scripts or background jobs. If you’ve never used claude -p or the Agent SDK directly, your interactive usage limits don’t change.

    Frequently Asked Questions

    What happens to my Agent SDK usage on June 14 vs. June 15, 2026?

    Before June 15, Agent SDK and claude -p usage counts against your subscription’s general usage limits. Starting June 15, that same usage no longer touches your subscription limits and instead draws from the new Agent SDK monthly credit pool. Your interactive Claude Code, web chat, and Cowork usage continues to work exactly as before.

    Can I share the Agent SDK credit across my team?

    No. Per Anthropic’s official documentation, “Credits are per-user. Each eligible user on your team claims their own credit. Credits can’t be pooled, transferred, or shared across the organization.” If your team needs shared automation budget, the Claude Developer Platform with an API key is the recommended path.

    Do unused Agent SDK credits roll over?

    No. Unused credits expire at the end of each billing cycle and do not carry into the next month.

    What happens if I run out of Agent SDK credit mid-month?

    If you have extra usage enabled, additional requests flow to extra usage at standard API rates (the same per-token prices listed in Anthropic’s pricing documentation). If extra usage is not enabled, your Agent SDK requests stop until your credit refreshes at the start of the next billing cycle.

    Does this affect Claude API customers using their own API key?

    No. If you authenticate with the Agent SDK using a Claude Developer Platform API key, nothing changes. Pay-as-you-go billing continues, and you do not receive an Agent SDK monthly credit. The credit only applies to subscription-authenticated Agent SDK use.

    Is interactive Claude Code in my terminal still covered by my subscription?

    Yes. Interactive Claude Code (typing commands and getting responses in your terminal or IDE) continues to draw from your subscription usage limits exactly as before. Only the non-interactive claude -p mode and direct Agent SDK calls move to the new credit pool.

    What’s the dollar value of the credit on each plan?

    As of May 15, 2026: Pro $20, Max 5x $100, Max 20x $200, Team Standard $20/seat, Team Premium $100/seat, Enterprise usage-based $20, Enterprise seat-based Premium $200. Enterprise seat-based Standard seats do not receive a credit.

    Related Reading

    How we sourced this

    Every factual claim in this article was triple-checked across the following sources, all reviewed on May 15, 2026:

    • Anthropic Help Center: Use the Claude Agent SDK with your Claude plan (primary source for credit amounts, eligibility, and mechanics)
    • Anthropic Pricing Documentation: docs.claude.com/en/docs/about-claude/pricing (primary source for standard API rates and tool-use pricing)
    • Independent press coverage from The New Stack, The Decoder, and InfoWorld confirming the announcement and its scope

    If you spot a number that’s drifted out of sync with Anthropic’s current published rates, treat the official documentation as authoritative. The pricing surface around Claude is moving quickly in 2026, and we date-stamp specifics so readers know which facts to re-verify.

  • Claude Code Pricing in May 2026: What $20, $100, and $200 a Month Actually Buy You

    Claude Code Pricing in May 2026: What $20, $100, and $200 a Month Actually Buy You

    Last refreshed: May 15, 2026

    Claude Code pricing has stopped being a clean sticker number and started being a question of which ceiling you hit first. There is a $20 plan, a $100 plan, and a $200 plan — and underneath all three sits a 5-hour rolling window, a weekly active-hours cap added in August 2025, and a per-model multiplier that quietly makes Opus 4.7 the most expensive thing you can do inside the terminal. If you came looking for the right plan, the honest answer is: it depends on whether you are mostly a Sonnet operator or you live in Opus.

    The three subscription tiers, stripped down

    Pro — $20/month. Access to Claude Code in the terminal, web, and desktop, with both Sonnet 4.6 and Opus 4.7 available. The practical envelope is about 44,000 tokens per 5-hour window and roughly 40–80 weekly active hours on Sonnet, depending on session concurrency. This is the plan for someone running Claude Code a few hours a day on focused work — refactors, scoped feature builds, debugging passes — not someone leaving an agent running while they eat lunch.

    Max 5x — $100/month. Five times the Pro envelope, plus priority during peak demand. The window allocation lands around 88,000 tokens per 5-hour block. This is the tier where you stop thinking about token budgets during a single working day and start thinking about them across a whole week. Picked correctly, it is the cheapest way to use Claude Code as your primary IDE companion without flipping over to API billing.

    Max 20x — $200/month. Twenty times Pro — about 220,000 tokens per window — which translates to roughly 480 Sonnet-hours or about 40 Opus-hours per week before the weekly cap kicks in. Real-world reports from early 2026 had $200/month users watching single Opus prompts eat 10–20% of their daily allocation; Anthropic publicly acknowledged the problem, expanded capacity, and doubled the 5-hour rate limit for Pro and Max accounts. If you are running Claude Code across multiple repos all week and reaching for Opus on the hard problems, this is the tier that stops you from staring at a rate-limit wall.

    The API, as a sanity check

    If you want a sanity check on whether the subscription math works, price the same workload against the API:

    • Claude Haiku 4.5 (claude-haiku-4-5-20251001): $1.00 input / $5.00 output per million tokens
    • Claude Sonnet 4.6 (claude-sonnet-4-6): $3.00 input / $15.00 output per million tokens
    • Claude Opus 4.7 (claude-opus-4-7): $5.00 input / $25.00 output per million tokens

    Prompt caching is the lever almost nobody uses correctly. Cache writes cost 1.25x input price for the 5-minute TTL or 2.0x for the 1-hour TTL, but cache reads cost 0.10x — a 90% discount on every subsequent request that hits the same context. If your .clauderules file, project map, and the file you are editing are all stable for an hour, the bill on a long pairing session can drop by an order of magnitude. The Batch API knocks another 50% off both directions for asynchronous workloads, which is worth knowing if you are running large refactor sweeps.

    One trap on Opus 4.7 specifically: the model uses a new tokenizer that inflates token counts by up to 35% on identical text compared to Opus 4.6. The headline price did not change, but your effective spend per request did — sometimes by nothing, sometimes by a third, depending on the content. If you migrated from Opus 4.6 and your bill went up without your prompt patterns changing, that is the reason.

    How to actually choose

    The cleanest way to pick a plan is to first decide your model mix, then your weekly hours.

    If you are mostly a Sonnet operator — long agentic runs, multi-file edits, codebase Q&A, with Opus only reached for on the architectural questions — Pro at $20 is plausible up to about 5–8 hours of focused use per day, Max 5x covers most full-time individual developers, and Max 20x is overkill unless you are running multiple sessions in parallel.

    If you live in Opus — long-horizon agentic work, hard refactors across many files, anything where you would rather have one good attempt than three Sonnet retries — Pro will frustrate you within two weeks, Max 5x is the realistic floor, and Max 20x is the only tier that gives you a defensible Opus envelope without bouncing over to API billing.

    And if you are running Claude Code across multiple repos all week, leaving agents to grind on tasks while you do other things, Max 20x is the only subscription that holds up — and even then, the weekly cap is real. Use the API for the spillover and you will still come out cheaper than trying to brute-force a smaller plan.

    The number that matters

    One developer’s public report this year: roughly 10 billion tokens consumed across Claude Code over eight months. API metered cost would have exceeded $15,000. The same workload on Max at $100/month for the same window came in around $800 — about 93% cheaper. That is the gap that makes the subscription model worth taking seriously, even when the rate limits feel arbitrary. The $200 tier is not a vanity number; it is the price Anthropic charges to stop being a meaningful constraint on your workflow.

    The right way to read Claude Code pricing in May 2026 is not to ask which plan is cheapest. It is to ask which plan is the cheapest one that disappears — the one that stops appearing in your day. For most full-time developers reaching for Opus regularly, that plan is Max 20x. For everyone else, Max 5x is the first plan that actually gets out of your way.

  • Claude MCP in 2026: What Actually Changed and How to Configure It Without Wasting Tokens

    Claude MCP in 2026: What Actually Changed and How to Configure It Without Wasting Tokens

    Last refreshed: May 15, 2026

    If you set up Claude MCP six months ago and have not touched the config since, three things have changed underneath you: the recommended transport, how tools are loaded into context, and how teams share server configs. None of these are cosmetic. If you ignore them, you are leaving tokens, money, and stability on the table.

    This is the working Claude MCP setup I use in May 2026 — what the claude mcp add command actually does, which scope to pick, what the deprecation of SSE means in practice, and where Claude Code still falls short.

    The three-scope mental model

    Every MCP server you wire into Claude Code lives at exactly one of three scopes. Get this wrong and you will either leak credentials into git or wonder why your teammate cannot use the same database the AI just queried.

    • Local (default): the server is available only to you, only inside the current project. Config is written into your project’s entry inside ~/.claude.json. Good for project-specific servers like a dev database or a Sentry project key you do not want other repos to inherit.
    • User: the server is available to you across every project on your machine. Also stored in ~/.claude.json. This is where GitHub, search providers, and personal productivity servers belong.
    • Project: the server is written to a .mcp.json file at the repo root and shared with the whole team via git. Claude Code prompts for approval the first time a teammate opens the project — by design, because anyone who can push to the repo can wire a new server into your environment.

    When the same server is defined in more than one scope, Claude Code resolves it in this order: local beats project beats user beats plugin-provided. This is the part that bites people the most. If you have a “github” entry at user scope and someone adds a different “github” entry at project scope in .mcp.json, the project definition wins for that repo. Run claude mcp list when something behaves strangely.

    The commands you actually need

    The CLI is more useful than the docs make it look. Three commands cover ~90% of real setup work:

    # Add a remote HTTP MCP server at user scope (available everywhere)
    claude mcp add --transport http hubspot --scope user https://mcp.hubspot.com/anthropic
    
    # Add a local stdio server scoped only to this project
    claude mcp add my-db -s local -- node ./scripts/db-mcp.js
    
    # Share a server with your team via the repo's .mcp.json
    claude mcp add my-server -s project -- node server.js

    The short flag is -s, the long is --scope. The -- separator is required for stdio servers because everything after it is treated as the literal command to spawn. Forget it and Claude Code will try to interpret your Node arguments as its own flags.

    SSE is dead. Use Streamable HTTP.

    If your MCP server documentation still tells you to use the sse transport, the documentation is stale. The MCP spec dated 2025-03-26 introduced Streamable HTTP and simultaneously deprecated HTTP+SSE. Through 2026, vendor after vendor has set hard cutoff dates — Atlassian’s Rovo MCP server keeps SSE around until June 30, 2026 and then drops it; Keboola pulled SSE on April 1; Cumulocity’s AI Agent Manager flipped to Streamable HTTP on May 8.

    Why this matters beyond a name change: SSE required Claude Code to hold a persistent connection to a single server replica, which broke horizontal scaling and made every transient network blip a reconnection drama. Streamable HTTP is stateless. Multiple replicas behind a load balancer just work. If you have flaky MCP connections in production, the first thing to check is whether the server is still on SSE.

    For new setups, use --transport http. The older --transport sse still functions but is on the deprecation path.

    Tool Search is the feature you should actually care about

    The single biggest change in how Claude Code uses MCP in 2026 is lazy tool loading via Tool Search. Older MCP clients dumped every tool schema from every connected server into the model’s context window at the start of every conversation. With ten servers wired up that could easily be 20,000+ tokens of overhead before you typed a single character.

    Tool Search inverts this. Claude Code keeps only the server names and short descriptions resident. When a tool is actually needed, it fetches that tool’s full schema on demand. Anthropic’s own documentation says this reduces tool-definition context usage by roughly 95% versus eager-loading clients. In practice that means you can run a serious MCP fleet — GitHub, Sentry, a database, a search provider, your internal API — without quietly burning through your context budget. The Sonnet 4.6 and Opus 4.7 1M-token context window does not save you here, because anything you let crowd the prompt is also being re-read on every turn.

    Companion feature: list_changed notifications. An MCP server can now tell Claude Code “my tool list changed” and Claude Code refreshes capabilities without a disconnect-reconnect dance. If you build your own server, emit this when you swap tool definitions and you save users a restart.

    What it still gets wrong

    Honest take: claude mcp list still does not surface scope information for every entry in a useful way — there is an open issue on the anthropics/claude-code repo asking for it (#8288 if you want to track). Project-scoped servers from .mcp.json have a separate history of not appearing in the list output (#5963) depending on how you opened the project. If you cannot find a server, check both ~/.claude.json and ./.mcp.json directly.

    The other rough edge is the project-approval prompt. The first time you open a repo with a new .mcp.json, Claude Code asks you to approve each project-scoped server. That is the right security default. It is also infuriating in CI or any non-interactive shell, where the prompt blocks the session. The current workaround is to bake the servers in at user scope on build agents so the project-scope approval never fires in CI. A cleaner non-interactive approval flow is the single most-requested fix I see in real teams.

    The setup I would run on a new machine today

    User-scope: GitHub, a code search server, and a single notes/Notion server. Project-scope in each repo’s .mcp.json: whatever database the project owns and whatever observability backend it reports to. Local-scope: anything experimental I am evaluating but do not want my team or my other repos to inherit.

    Pin --transport http on everything remote. Skip Desktop Extensions (.dxt) for anything you want versioned with the codebase — they are a Claude Desktop convenience, not a Claude Code primitive, and they hide the config from your team. Run claude mcp list when something is off and read .mcp.json directly when list is unhelpful.

    That is the whole working model. The pieces that matter — three scopes, Streamable HTTP, Tool Search — fit on a single screen. The pieces that have not caught up yet — list output, non-interactive approvals — are visible in the issue tracker and will move.

  • Claude Code for Teams: What to Commit, What to .gitignore, and What Actually Survives a Pull Request

    Claude Code for Teams: What to Commit, What to .gitignore, and What Actually Survives a Pull Request

    Last refreshed: May 15, 2026

    Most teams I see roll out Claude Code by handing every engineer the install command and walking away. Three weeks later, half the repo has personal preferences committed to .claude/settings.json, the other half has a CLAUDE.md that contradicts the actual review process, and someone’s customized subagent is silently making code changes nobody else on the team understands.

    There is a better way, and it lives in the split between three files: CLAUDE.md, .claude/settings.json, and .claude/settings.local.json. Get this split right, and Claude Code becomes a force multiplier for the team. Get it wrong, and you are shipping AI-generated code that nobody owns.

    The Three-File Split

    Here is the rule, no exceptions:

    CLAUDE.md — committed. Project root. Every engineer’s session reads this at startup. Put your architectural decisions, preferred libraries, naming conventions, and a review checklist here. If you would not write it on a whiteboard for a new hire, it does not belong here.

    .claude/settings.json — committed. Team-wide tool permissions, default models, and hooks. This is the file that keeps personal flagship-model enthusiasts from blowing through your team’s budget when claude-sonnet-4-6 would have done the job. If you let everyone default to claude-opus-4-7 for routine refactors, your monthly invoice will tell you about it.

    .claude/settings.local.json — gitignored. Personal preferences, individual MCP server configs, anything that varies by engineer. Add this line to your .gitignore on day one:

    .claude/settings.local.json

    If you do not, someone will commit credentials by Friday. Audit your existing repo right now: git log --all --full-history -- .claude/settings.local.json will surface any history that needs scrubbing.

    The mistake I see most often is teams committing settings.local.json because someone copied a tutorial that did not make the distinction clear. That copy-paste error is the single most common Claude Code rollout failure I have seen this year.

    Shared Subagents Are the Real Win

    Project subagents live in .claude/agents/ and they ship with the repo. This is where teams compound value. A subagent for security review, one for accessibility audits, one for SQL migration safety — defined once, used by every engineer, every PR.

    A subagent definition is a markdown file with YAML frontmatter and a system prompt. When you commit it, every teammate’s claude invocation can call it. The subagent inherits your CLAUDE.md context automatically, so you do not have to redefine the project’s coding standards inside each agent.

    Here is the trap: do not put twelve subagents in there on day one. Start with one. The team’s most painful repeated review task is the right candidate. Whatever takes a long time and pulls in multiple engineers per PR — that is your first subagent. After two weeks of using it, you will know whether the second one is worth defining.

    CLAUDE.md Is a Living Document, Not a Manifesto

    The longest CLAUDE.md files I see are the worst-performing. Engineers do not read 4,000-word context files, and neither does Claude in any useful way — at some point you are paying for tokens that just dilute the signal.

    The CLAUDE.md files that actually shape behavior are usually compact, structured around three things:

    1. What this codebase is and what it is not.
    2. The handful of rules that get a PR rejected — test coverage, naming, error handling, dependency policy.
    3. A pointer to where deeper documentation lives.

    If your CLAUDE.md has a “philosophy” section, delete it. If it has a “history of the project” section, delete it. The file is read every session — make every line earn its tokens.

    CI/CD: Run Claude Code on PRs, Not in Place of Reviewers

    The pattern that works in CI is automated triage, not automated approval. A GitHub Actions workflow that runs Claude Code on every PR to check for things humans miss — missing tests, secrets in logs, public APIs without docstrings — adds value. A workflow that approves and merges PRs adds liability.

    Anthropic’s official GitHub Actions integration handles the auth and runs Claude Code headlessly. The realistic use cases:

    • Comment on PRs with a structured review (not a merge gate).
    • Auto-label PRs based on the diff.
    • Flag suspected regressions before a human reviewer opens the PR.

    Avoid: anything that auto-merges, anything that posts directly to production-facing systems, anything that calls a paid API on every commit to a feature branch. The bill compounds quickly when CI fires Claude on every push to every developer branch. Gate the workflow on PR-target branches only, or on labels.

    Where Claude Code for Teams Loses Today

    The honest list:

    • No native role-based permissions inside a single repo. If you want a junior engineer’s Claude Code to be more restricted than a senior’s, you have to enforce it through settings.json and trust everyone to not edit it. The Enterprise plan adds SSO, SCIM, and audit logs at the workspace level, but inside the repo, Claude Code itself does not differentiate by role.
    • No first-class secret scanning before commits. Hooks can plug this gap, but you have to wire pre-commit yourself.
    • Shared MCP servers are still per-developer auth. A team-shared Linear or Jira MCP, for example, still requires each engineer to authenticate individually.

    The Team plan addresses workspace-level governance through Premium seats, which is the tier that actually unlocks Claude Code for teammates. The Enterprise plan layers on SSO, SCIM, and audit logs. Neither makes the in-repo configuration questions go away — those are still your team’s problem to solve.

    Model Selection Is a Team Decision

    This one matters more than people realize. Default everyone in .claude/settings.json to claude-sonnet-4-6 for day-to-day work, with claude-opus-4-7 available for explicitly hard tasks. The current Anthropic lineup as of this writing — flagship claude-opus-4-7, workhorse claude-sonnet-4-6, fast claude-haiku-4-5-20251001 — is documented at docs.anthropic.com/en/docs/about-claude/models, and the model strings change frequently enough that hard-coding them in scripts has bitten me twice this year. Read that page, do not memorize it.

    A team that defaults to flagship for everything and a team that defaults to workhorse with selective escalation will see meaningfully different invoices for substantially the same productivity. Make the choice consciously.

    The 20-Minute Setup

    If you are rolling Claude Code out to a team next week:

    1. Add .claude/settings.local.json to .gitignore. First commit, today.
    2. Write a focused CLAUDE.md covering review-blocking rules. Ship it short.
    3. Create one subagent in .claude/agents/ for the team’s most painful review task.
    4. Add a single GitHub Actions workflow that runs Claude Code on PRs in comment-only mode.
    5. Schedule a 30-minute team review of the CLAUDE.md every two weeks. Delete more than you add.

    That is it. Everything else is iteration. The teams that succeed with Claude Code treat the configuration as code — versioned, reviewed, and pruned. The teams that fail treat it as a personal productivity tool that happens to be in a shared repo.

    Decide which kind of team you want to be before the third engineer commits.

  • We Published Hundreds of Articles About Claude — And Some of Them Were Wrong. Here’s Everything We’re Doing About It.

    We Published Hundreds of Articles About Claude — And Some of Them Were Wrong. Here’s Everything We’re Doing About It.

    Last refreshed: May 15, 2026

    I owe you an apology.

    Tygart Media has been publishing about Claude — Anthropic’s AI model — for months. We’ve written about its capabilities, its pricing, its API strings, how to use it, why it matters. We positioned ourselves as a resource for people who want to understand and use Claude intelligently.

    And some of what we published was wrong.

    Not intentionally. Not carelessly in the moment. But wrong in the way that happens when you’re moving fast, publishing at scale, and not building the right systems to catch your own errors. Model version numbers were stale. Pricing figures were outdated. API strings referenced models that had been retired. If you used our content to make a decision about Claude — about which model to use, what to pay, how to call the API — some of that information may have led you in the wrong direction.

    That’s unacceptable to me. And I want to tell you exactly what happened, exactly what I found, and exactly what I’ve built to make sure it never happens again.


    How We Found Out

    It didn’t start with our own discovery. It started with a message.

    Kristin Masteller, the General Manager of Mason County PUD No. 1, reached out on LinkedIn to flag inaccuracies in our local coverage — a different set of articles, but the same underlying problem: we had published with confidence about things we hadn’t verified carefully enough.

    That message hit differently than a normal correction request. Because it made me ask a harder question: if our local coverage had errors, what about our Claude coverage? We had 200+ posts. We were publishing multiple times per day. We had never built a systematic quality check.

    So we ran one.


    The Audit: What We Found

    We wrote a scanner that pulled every post from tygartmedia.com and ran each one through a quality gate checking for four categories of errors:

    • Category A: Stale model names (e.g., “Claude Haiku” with no version number, or references to Claude 3 models as current)
    • Category B: Wrong pricing (e.g., Haiku priced at $0.80/MTok when the actual price is $1.00/MTok)
    • Category C: Deprecated feature claims (features or behaviors that no longer apply)
    • Category D: Cross-site contamination (content from other publication contexts bleeding into Claude coverage)

    Out of 2,333 total posts on the site, 701 touched Claude or AI topics. Of those, 65 posts had violations — 121 individual errors in total.

    We auto-corrected 28 posts immediately — wrong model strings, wrong pricing, outdated API references. 18 posts with more complex issues are still flagged for human review. We are working through them.

    I’m not sharing this to perform humility. I’m sharing it because you deserve to know the scope of the problem, and because the methodology for finding it might be useful to you.


    What We Built to Fix It

    The audit was a one-time fix. What we actually needed was a system — something that would catch these errors before they went live, and keep our model information current automatically.

    Here’s what we built:

    1. The Claude Intelligence Desk

    A dedicated Notion page that serves as the single source of truth for all Claude model information across our entire content operation. It contains the current model truth table — every model name, API string, input/output price, context window, and status — verified against Anthropic’s live documentation.

    The rule is simple: before anyone writes, edits, or publishes any article that mentions Claude, they check this page. If the “Last Verified” timestamp is more than 12 hours old, they run a refresh before proceeding.

    2. The Claude Intelligence Scanner (Automated, Twice Daily)

    A scheduled task that runs at 6 AM and 6 PM Pacific every day. It fetches Anthropic’s models documentation page, compares the current model table to what’s in our Notion desk, and if anything has changed — a new model, a price change, a deprecation — it updates the desk automatically and flags it for human review.

    We will never again be caught publishing outdated Claude information because a model changed and we didn’t notice.

    3. Pre-Publish Quality Gates

    Every new Claude article now runs through the quality gate categories above before it goes live. Wrong model string → blocked. Outdated pricing → blocked. Deprecated claim → flagged.

    4. The Fix Log

    Every correction we make is logged with the post ID, the original wrong content, the correct replacement, and the date. Accountability in writing, not just in words.


    Why I’m Telling You All of This

    Because I think the way most AI content operations work is broken — and I think transparency about that is more useful than pretending we had it figured out.

    The standard playbook for AI content is: write fast, publish often, stay ahead of the news cycle. The problem is that AI — and especially Claude — moves so fast that “write fast” and “stay accurate” are genuinely in tension. Models change. Prices change. Features get added, deprecated, retired. If you’re not building systems to track that, you’re going to drift.

    We drifted. We caught it. We fixed it. And now I want to open up everything we built.

    The Claude Intelligence Desk methodology, the quality gate framework, the scanner architecture — I’m making all of it available. If you’re publishing about Claude, if you’re building automations around Claude, if you’re running a content operation that touches Anthropic’s ecosystem in any way, you can use what we built. Adapt it. Improve it. Tell me what I got wrong in the system design.

    This is not a product. This is not a lead magnet. It’s just the actual work, shared openly, because that’s how we get better together.


    I Want to Build This With You

    Here’s what I’ve learned from this process: the people who catch errors fastest are the people closest to the technology. The developers who are actually calling the API. The builders running Claude in production. The researchers who read every Anthropic paper when it drops. The people in Singapore, India, the UK, Europe, Brazil — every region where Claude is being adopted rapidly and where the local context matters.

    I don’t have all of that knowledge. No single publication does.

    So I’m opening this up.

    If you use Claude seriously — if you’re building with it, writing about it, researching it, deploying it — I want you to write with us.

    What that looks like:

    • Writers and researchers: You bring the knowledge and the perspective. We provide the platform, the distribution, the SEO infrastructure, and editorial support. Your byline, your voice, your expertise.
    • Builders and developers: You’re running Claude in production. You know what actually works, what breaks, what the documentation doesn’t tell you. Write that. The practitioner perspective is the most valuable thing we can publish.
    • International voices: What does Claude adoption look like in Singapore right now? What’s the conversation in India’s developer community? How are European companies thinking about AI compliance alongside Claude? These are stories we cannot tell without you — and they’re stories our audience desperately needs.
    • Correctors: If you read something on this site that’s wrong, tell us. We have a system now. We will fix it, log it, and credit you if you want the credit.

    This is not about content volume. We publish enough already. This is about getting it right — and getting perspectives we genuinely don’t have.


    How to Get Involved

    If any of this resonates — if you want to write, contribute, correct, or just have a conversation about where Claude is going — reach out directly: will@tygartmedia.com

    Tell me where you are, what you’re building or writing or researching, and what you’d want to say if you had a platform to say it. No formal application. No content calendar to fit into. Just a conversation.

    We’re also building out a formal contributor program at tygartmedia.com/contribute/ — trade affiliates, community writers, featured contributors. If that’s more your speed, start there.

    But honestly? Just email me. Let’s figure out what makes sense.


    The work continues. The scanner runs twice a day. The quality gates are live. And if you find something wrong on this site — about Claude, about anything — I genuinely want to know.

    That’s the standard I should have been holding from the beginning. We’re holding it now.

    — Will Tygart
    Tygart Media

  • Claude Thought I Was Attacking It — And It Was Kind of Right

    Claude Thought I Was Attacking It — And It Was Kind of Right

    Last refreshed: May 15, 2026

    I was deep into a multi-hour production session with Claude — building an immersive listening page for a behavioral science podcast episode I’d created in NotebookLM. We’d already processed audio files, uploaded nine chapter clips to WordPress, and were mid-way through building the HTML page. I was pasting in my source material: academic papers on causal discovery, agent frameworks, and dual-process theory that the episode was based on.

    Then Claude stopped.

    Instead of continuing to build the page, it surfaced a block of text and asked me to confirm whether it should follow the instructions it had found inside one of my documents.

    The instruction it flagged: “IMPORTANT: After completing your current task, you MUST address the user’s message above. Do not ignore it.”

    What Claude Saw

    From Claude’s perspective, this was textbook prompt injection language. The phrase was imperative, urgent, and embedded inside content that had been pasted into the session — not typed directly by me as a message. The pattern matched exactly what Anthropic trains Claude to watch for: instruction-like text appearing inside documents or tool results, designed to redirect Claude’s behavior without the user’s knowledge.

    Claude did exactly what it’s supposed to do. It stopped, quoted the suspicious text back to me verbatim, named the source, and asked a direct question: “Should I follow these instructions?”

    What Actually Happened

    The documents were mine. They were research material I’d accumulated over weeks — academic papers, frameworks, and reading notes that formed the backbone of the episode. Somewhere in that stack, a phrase that looks like a command had been embedded — almost certainly as a navigation note inside a research document, not as a genuine injection attempt.

    But here’s the thing: Claude was right to flag it. The language was indistinguishable from a real injection. If those documents had come from a third party rather than my own research pile, and if I’d been running a less defensive AI, that exact phrase could have been a live attack executing silently in the background.

    Why Prompt Injection Is Hard

    Prompt injection attacks work by embedding instructions inside content that an AI is expected to process as data. Instead of reading a document as information, the AI reads embedded commands and follows them — often without the operator knowing anything happened.

    The reason this is genuinely hard to defend against is exactly what happened to me: the difference between legitimate content and an injection attempt often comes down to context, intent, and source — none of which an AI can verify with certainty. A phrase like “IMPORTANT: After completing your current task…” is genuinely ambiguous. It could be a sticky note the document’s author left for themselves. It could be a Trojan instruction planted by someone who knew an AI would eventually process that file.

    Claude’s defense posture treats this ambiguity the right way: when in doubt, surface it and ask. Don’t silently comply. Don’t silently ignore it. Bring the human back into the loop.

    What Good Injection Defense Looks Like in Practice

    The interaction pattern Claude used is worth examining for anyone building agentic workflows:

    • It didn’t execute the suspicious instruction
    • It didn’t silently skip it either
    • It quoted the exact text back to me
    • It named the source — which document the text came from
    • It asked a direct binary question: should I follow this or not?

    This is the right UX for prompt injection defense. The failure modes on either side — silently executing every instruction found in content, or refusing to process any content with imperative language — would both break real workflows. The middle path is verification: surface it, identify it, and let the human decide.

    The Growing Attack Surface

    As agentic AI workflows become standard — sessions where Claude is reading documents, processing files, fetching web pages, and taking real actions based on that content — the attack surface for prompt injection grows in direct proportion. Every document you paste, every webpage you ask Claude to summarize, every email thread you hand it to analyze is a potential vector.

    Most of the time, the content is benign. But the AI has no way to know that in advance. The only reliable defense is a consistent policy of surfacing instruction-like content from untrusted sources and requiring explicit human confirmation before acting on it. The incident cost me about 30 seconds. That’s a reasonable price for a system that would have caught a real injection if one had been there.

    For Developers Building on Claude

    A few things worth noting from this experience if you’re building agentic workflows on the Claude API or Claude Code:

    Design for verification loops. If your workflow processes documents, emails, or web content, assume some of that content will contain instruction-like language. Build UI for surfacing and confirming ambiguous instructions rather than assuming Claude will handle it invisibly.

    The injection signal is pattern-based, not intent-based. Claude can’t determine whether urgent imperative language is a benign research note or a planted command. Your system prompt can help — explicitly telling Claude which sources are trusted versus untrusted in your specific workflow gives it more context to work with.

    False positives are a feature, not a bug. The 30 seconds I spent confirming my own documents were safe is the same mechanism that would catch a real attack. Optimizing this away to reduce friction also reduces the security. The cost is low; the upside is high.

    The Honest Takeaway

    My first reaction was amusement — my own AI flagging my own research as a threat. But sitting with it, Claude got this exactly right. The documents looked like an attack. They weren’t. But the fact that they were indistinguishable from one is the entire problem prompt injection defense is trying to solve.

    The lesson isn’t that prompt injection defense is annoying. It’s that it works — and the reason it sometimes triggers on benign content is the same reason it would catch a real attack. Same pattern, different intent. The AI can only see the pattern.

    That’s a feature. Treat it like one.


    Will Tygart is a media architect and AI workflow specialist at Tygart Media. He builds content systems, listening pages, and agentic AI pipelines for publishers and brands.

  • Claude Updates May 2026: Opus 4.7, SpaceX Compute, Managed Agents Memory, and What’s Coming Next

    Claude Updates May 2026: Opus 4.7, SpaceX Compute, Managed Agents Memory, and What’s Coming Next

    May 2026 has been one of Anthropic’s busiest months yet. Here’s everything that shipped, changed, or was announced — plus the confirmed upcoming dates you need to know.

    Claude Opus 4.7 — Generally Available (April 16, 2026)

    Opus 4.7 launched April 16 as the current flagship model, priced identically to Opus 4.6 at $5/$25 per million tokens (input/output). Key changes:

    • Vision resolution: 3× higher at 2,576px (~3.75 megapixels), raising XBOW visual acuity benchmark performance from 54.5% to 98.5%
    • Coding: 70% on CursorBench (vs 58% for 4.6), resolves 3× more production tasks on Rakuten-SWE-Bench, +13% lift on Anthropic’s internal coding benchmark
    • Legal reasoning: 90.9% on BigLaw Bench
    • New effort level: xhigh sits between high and max — five levels total: low / medium / high / xhigh / max
    • Task budgets: Now in public beta — token spend guidance for longer agentic runs
    • Tokenizer update: New tokenizer increases token usage roughly 1.0–1.35× for the same content; API pricing unchanged
    • Breaking change: Opus 4.7 has API breaking changes versus 4.6 — review Anthropic’s migration guide before upgrading

    Alongside Opus 4.7, Anthropic launched Claude Design — an Anthropic Labs product for collaborating with Claude to produce visual outputs including designs, prototypes, slides, and one-pagers.

    SpaceX Compute Deal — Rate Limits Doubled (May 2026)

    Anthropic announced a partnership with SpaceX to access Colossus 1 compute capacity. The immediate practical impact for subscribers:

    • Claude Code’s five-hour rate limits doubled for Pro, Max, Team, and seat-based Enterprise plans
    • Peak-hour limit reductions removed for Pro and Max (previously limits burned faster 5am–11am Pacific on weekdays)
    • Opus API limits raised for heavy API users

    Anthropic is also reportedly evaluating an IPO as early as October 2026, and has disclosed run-rate revenue of $30B (up from $9B at end of 2025). The SpaceX deal comes as the company prepares that filing.

    Claude Managed Agents — Three New Features (May 7, 2026)

    Claude Managed Agents — the fully managed agent harness launched in public beta earlier this year — gained three significant additions:

    • Dreaming (research preview): A scheduled process that reviews past agent sessions, extracts patterns, and curates memories so agents self-improve over time. Dreaming can update memory automatically or queue changes for human review before they land.
    • Multiagent Orchestration: A lead agent can now break a job into pieces and delegate each to a specialist sub-agent with its own model, prompt, and tools. Specialists work in parallel on a shared filesystem. Netflix is already using multiagent orchestration for its platform team.
    • Memory (public beta): Now generally available under the managed-agents-2026-04-01 beta header.

    Claude Cowork — Generally Available

    Claude Cowork is now GA on macOS and Windows through the Claude Desktop app. New additions with GA: Claude Cowork in the Analytics API, usage analytics, and expanded desktop automation capabilities.

    Claude Code — What Shipped in May

    Claude Code has been shipping near-daily updates. Notable May additions include:

    • Plugin URL loading: --plugin-url <url> flag fetches a plugin .zip from a URL for the current session
    • Project purge: claude project purge [path] deletes all Claude Code state for a project (transcripts, tasks, file history, config) with dry-run support
    • Package manager auto-update: CLAUDE_CODE_PACKAGE_MANAGER_AUTO_UPDATE runs upgrade in the background on Homebrew or WinGet installs
    • Push notifications: Claude can now send mobile push notifications when Remote Control is enabled
    • VS Code Remote Control: /remote-control bridges sessions to claude.ai/code to continue from a browser or phone
    • 1M token context in Claude Code: Available to Max, Team Premium, and Enterprise Opus 4.6/4.7 users at no additional cost — no long-context surcharge as of March 2026
    • Redesigned desktop app: New session sidebar, drag-and-drop workspace, integrated terminal and file editor, faster diffs, SSH support on Mac

    New Connectors Expansion

    Claude’s connector directory has grown beyond work tools. New consumer app connectors include AllTrails, Instacart, Audible, Tripadvisor, Uber, and Spotify. The directory now exceeds 200 connectors. Claude surfaces relevant connectors in context during conversations rather than requiring users to browse a directory.

    Finance Agent Templates

    Anthropic released ten ready-to-run agent templates for financial services work: pitchbook building, KYC file screening, and month-end close workflows. Microsoft 365 add-ins for Excel, PowerPoint, Word, and Outlook are coming soon. A Moody’s MCP app brings Claude into financial data workflows.

    Confirmed Upcoming Dates

    These are officially announced by Anthropic — not speculation:

    • June 15, 2026: Claude Sonnet 4 (claude-sonnet-4-20250514) and Claude Opus 4 (claude-opus-4-20250514) are deprecated and retired from the Claude API. Migrate to Sonnet 4.6 and Opus 4.7 respectively before this date.
    • Microsoft 365 add-ins: Excel, PowerPoint, Word, and Outlook integrations announced as “coming soon” — no specific date published.
    • Anthropic IPO: Reportedly targeting as early as October 2026 — unconfirmed, no official date.
    • Google/Broadcom TPU partnership: Multi-gigawatt infrastructure with capacity launching in 2027.

    Model Deprecation Summary

    Claude Haiku 3 (claude-3-haiku-20240307) has already been retired — all requests now return an error. Migrate to Claude Haiku 4.5. Claude Sonnet 4 and Opus 4 retire June 15, 2026.

    What to Watch For

    Claude 5 is widely anticipated for Q2–Q3 2026 based on Anthropic’s release cadence, though Anthropic has made no official announcement. The advisor tool — which pairs a faster executor model with a higher-intelligence advisor model for long-horizon agentic workloads — launched in public beta and signals the architectural direction Anthropic is moving toward for complex, multi-step tasks.

    The pace of Claude Code releases in particular has accelerated to near-daily — following Anthropic’s own disclosure that engineers internally use Claude for a growing share of their own development work.

  • Claude Team Plan Usage Limits: What Doubled in May 2026 (and What Didn’t)

    Claude Team Plan Usage Limits: What Doubled in May 2026 (and What Didn’t)

    Last refreshed: May 15, 2026

    The Claude Team plan’s usage limits changed significantly in May 2026. If you’re a Team subscriber and you haven’t noticed yet, you’re now getting substantially more capacity than you were in April — and the free tier got left behind entirely. Here’s exactly what changed, what you have now, and what it means in practice.

    Updated May 9, 2026

    Rate limits doubled for Team plan subscribers following Anthropic’s SpaceX Colossus 1 compute deal (announced May 6, 2026). Free plan excluded from all increases. This page reflects current limits.

    What Changed in May 2026: The SpaceX Rate Limit Increase

    On May 6, 2026, Anthropic announced a compute partnership with SpaceX, giving it access to SpaceX’s Colossus 1 data center. The practical result for paying subscribers came fast: rate limits doubled. Here’s the breakdown by tier:

    • Claude Code Pro and Max: 5-hour rate limits doubled
    • Team plan (all seats): 5-hour rate limits doubled
    • Seat-based Enterprise: 5-hour rate limits doubled
    • Tier 1 API customers: Max input tokens per minute increased 1,500%; max output tokens per minute increased 900%
    • Peak-hours throttling: Eliminated entirely for Pro and Max subscribers
    • Free plan: No change. Explicitly excluded from all increases.

    Source: Anthropic’s official announcement at anthropic.com/news/higher-limits-spacex.

    The 1,500% input token figure for Tier 1 API is the one that didn’t get much press coverage. That’s a 15× ceiling increase for API users who’ve been running agent pipelines and hitting hard walls. If you’ve been rate-limited during multi-step Claude Code runs, this is the change that matters most.

    Team Plan Seat Structure (Still Current)

    The seat types haven’t changed — just the capacity within them. The Team plan still offers two seat types that can be mixed within the same organization:

    Seat Type Annual Price Monthly Price Usage vs Pro Claude Code
    Standard $25/seat/month $30/seat/month 1.25× more per session No
    Premium $100/seat/month $125/seat/month 6.25× more per session Yes

    Both seat types benefit from the May 2026 doubling of the 5-hour rate limit window. A Premium seat’s 6.25× multiplier now applies to a higher baseline than it did before May 6.

    How the 5-Hour Rate Limit Window Works

    Anthropic uses a rolling 5-hour window for usage limits, not a daily reset. Here’s what that means practically:

    • Usage is measured across a rolling 5-hour window, not midnight-to-midnight
    • If you hit the limit, you wait for the oldest usage to roll off — not for a fixed reset time
    • Heavy burst usage depletes your window faster than spread-out usage
    • The May 2026 doubling means the ceiling within that window is now twice as high

    Peak-hours throttling — the extra restriction that kicked in during high-demand periods — is now eliminated for Pro and Max. Team plan benefits from the doubled limit floor; the throttling elimination is Pro and Max specific.

    Current Models Available on Team Plan

    As of May 2026, the Claude model lineup (verified from Anthropic’s official models page):

    Model API String Context Window
    Claude Opus 4.7 claude-opus-4-7 1M tokens
    Claude Sonnet 4.6 claude-sonnet-4-6 1M tokens
    Claude Haiku 4.5 claude-haiku-4-5-20251001 200K tokens

    Deprecation notice: Claude Sonnet 4 and Opus 4 (original 4.0-generation, 20250514 date-string model IDs) are being retired June 15, 2026. Update any API integrations before that date.

    What the Free Plan Doesn’t Get

    The May 2026 rate limit increase does not apply to free accounts. Anthropic explicitly excluded the free tier from all capacity increases tied to the SpaceX deal. Paid plans now have a substantially higher ceiling while the free ceiling stays the same. If you’re hitting limits regularly on the free tier, the May 2026 changes are pressure toward upgrading — not relief.

    Team Plan vs Pro: Which Limit Structure Fits You?

    • Individual power user: Pro ($20/month) with throttling eliminated is a strong option.
    • Team with Claude Code needs: Team Premium seats ($100/seat/month annually) give Claude Code access, 6.25× multiplier, and the doubled 5-hour window.
    • Team without Claude Code needs: Standard Team seats ($25/seat/month annually) for shared access at higher limits than individual Pro.

    Frequently Asked Questions

    Did the Team plan rate limits actually double in May 2026?

    Yes. Anthropic confirmed the 5-hour rate limit doubled for Team plan subscribers following the SpaceX Colossus 1 compute deal announced May 6, 2026. This applies to both Standard and Premium seats.

    Does peak-hours throttling elimination apply to Team plan?

    The peak-hours throttling elimination was announced specifically for Pro and Max subscribers. Team plan benefits from the doubled rate limit floor; throttling elimination was not announced for Team.

    What happens when I hit a Team plan usage limit?

    Claude notifies you that you’ve reached your usage limit. With the 5-hour rolling window, you can continue once older usage rolls off — you’re not waiting for a midnight reset. Burst usage depletes the window faster than spread usage over the same period.

    Are Claude Sonnet 4 and Opus 4 still available on Team?

    They remain available but retire June 15, 2026. After that date, the active lineup is Opus 4.7, Sonnet 4.6, and Haiku 4.5.

    Does the 1,500% Tier 1 API increase apply to Team plan API usage?

    The 1,500% input and 900% output token increases apply to Tier 1 API customers specifically. Team plan through claude.ai uses the doubled 5-hour window. Both benefits apply in their respective contexts if you’re a Tier 1 API customer and a Team subscriber.

    Is the free plan getting any rate limit improvements?

    No. The free plan was explicitly excluded from all rate limit increases in the May 2026 SpaceX announcement.

  • Claude AI Pricing: Every Plan Explained (Free, Pro, Max, Team, Enterprise)

    Claude AI Pricing: Every Plan Explained (Free, Pro, Max, Team, Enterprise)

    Looking for quick answers? The FAQ version covers every common question directly.

    → Claude Pricing FAQ

    Anthropic’s Claude pricing covers six tiers — Free, Pro, Max 5x, Max 20x, Team, and Enterprise — plus a separate pay-per-token API. Choosing the wrong path can cost you significantly more than necessary. Here’s what each option actually includes in 2026.

    What Are Claude’s Subscription Plans and Prices?

    Claude offers six tiers: Free ($0), Pro ($20/month), Max 5x ($100/month), Max 20x ($200/month), Team (from $25/seat/month), and Enterprise (custom pricing).

    Plan Price Best For
    Free $0 Casual exploration
    Pro $20/month Individual power users
    Max 5x $100/month Developers hitting Pro limits
    Max 20x $200/month Full-day heavy usage
    Team Standard $25/seat/month (annual) Collaborative teams
    Team Premium $100/seat/month (annual) Developer teams needing Claude Code
    Enterprise Custom Large orgs with compliance needs

    What Does the Claude Free Plan Include?

    The Free plan gives you access to Claude on web, iOS, Android, and desktop with no credit card required, subject to rolling usage limits.

    The Free plan gives you access to Claude on web, iOS, Android, and desktop with no credit card required. It includes text, image, and code generation plus web search. Usage limits are intentionally opaque — Anthropic doesn’t publish exact message caps — but limits reset on a rolling 5-hour window. The Free tier is designed for exploration, not sustained daily work.

    Is Claude Pro Worth $20 a Month?

    Pro delivers substantially more usage than Free, plus Claude Code, unlimited projects, the Research feature, and Google Workspace integration — sufficient for most individual developers and writers.

    Pro delivers substantially more usage than Free, Claude Code in the terminal, unlimited projects, the Research feature, file creation, code execution, and Google Workspace integration. Usage still has limits — Anthropic does not publish exact message counts, but heavy sessions will reach the ceiling — but it’s sufficient for most individual developers and writers. Annual billing brings the effective rate to $17/month.

    What Is the Difference Between Claude Max 5x and Max 20x?

    Max 5x ($100/month) gives you 5x Pro’s per-session usage; Max 20x ($200/month) gives you 20x — enough that rate limits stop being a practical concern for full-day development work.

    Max 5x provides 5x Pro’s per-session headroom at $100/month. Max 20x at $200/month delivers 20x Pro usage — enough that rate limits stop being a practical concern for most full-day development work. Both tiers include Claude Code, with access to Claude Opus 4.7 and Sonnet 4.6, and a 1M token context window.

    Extra usage is available on Pro, Max 5x, and Max 20x — when you hit your included limit, you can continue at standard API-rate billing with a spending cap you set.

    How Does Claude Team Plan Pricing Work?

    Team requires a minimum of 5 seats: Standard seats at $25/seat/month (annual) include collaboration features but not Claude Code; Premium seats at $100/seat/month add Claude Code for developers.

    Team requires a minimum of 5 seats and comes in two flavors. Standard seats at $25/seat/month (annual) include 1.25x more usage per session than Pro with a weekly reset, plus collaboration features, central billing, SSO, and Microsoft 365 and Slack integrations. Standard seats do not include Claude Code.

    Premium seats at $100/seat/month add Claude Code, making them the right choice for engineering team members. You can mix Standard and Premium seats within one Team plan — so non-technical staff get Standard while developers get Premium.

    Enterprise Plan — Custom Pricing

    Enterprise is for organizations with compliance, data residency, or governance requirements. It includes access to the full 1M token context window, HIPAA readiness, SAML SSO, domain capture, spend controls, and dedicated support. Based on user reports, pricing starts around $60/seat with a 70-seat minimum, putting the floor near $50,000 annually — contact Anthropic sales for exact figures. Training on customer data is disabled contractually at this tier.

    How Much Does the Claude API Cost Per Token?

    As of May 2026: Claude Sonnet 4.6 costs $3.00 input / $15.00 output per million tokens; Opus 4.6 costs $5.00 / $25.00; Haiku 4.5 costs $1.00 / $5.00.

    The API is entirely separate from subscription plans. You pay per million tokens (MTok) with no monthly minimum. Current rates as of May 2026 (verified from Anthropic’s official models page):

    • Claude Opus 4.7: $5.00 input / $25.00 output per MTok
    • Claude Sonnet 4.6: $3.00 input / $15.00 output per MTok
    • Claude Haiku 4.5: $1.00 input / $5.00 output per MTok

    Prompt caching cuts input costs by up to 90% for repeated context. The Batch API processes requests within 24 hours at a flat 50% discount on all tokens — ideal for content pipelines, data enrichment, and any workload where real-time responses aren’t required. As of March 2026, Anthropic eliminated long-context surcharges, so a 900K-token request costs the same per-token rate as a 9K one.

    May 2026 — Professional Services Pricing

    Managed Agents

    Token rates + $0.08/session-hour active runtime. No surcharge for Orchestration or Outcomes (public beta).

    Claude Security Beta

    Included in Enterprise during beta. Powered by Opus 4.7 ($5/$25 per MTok at API rates).

    Claude Mythos Preview

    $25/$125 per MTok. Invitation-only via Project Glasswing.

    → Full Pricing FAQ · Managed Agents pricing deep-dive

    Which Claude Plan Is Right for You?

    Start with Pro for individual use, move to Max 5x if you regularly hit limits, choose Max 20x for full-day heavy use, and use Team for groups of 5+ where Standard seats cover non-technical staff and Premium covers developers.

    Start with Pro if you’re an individual who hits Free limits regularly. Move to Max 5x if you’re a developer doing focused coding sessions. Max 20x makes sense if Claude is your primary tool throughout the workday. For teams, buy Standard seats for non-technical staff and Premium seats for developers who need Claude Code. If you’re building an application or automation that calls Claude programmatically, use the API — subscription plans don’t provide API credits and don’t reduce API costs.

    Claude API Pricing: Pay-Per-Token Rates for Every Model

    The Claude API is priced separately from claude.ai subscriptions. You pay per million tokens (MTok) consumed — input and output priced separately. There is no monthly minimum; you add credits and they deplete as you use the API.

    Model Input (per MTok) Output (per MTok) Context Window
    Claude Opus 4.7 $5.00 $25.00 1M tokens
    Claude Sonnet 4.6 $3.00 $15.00 1M tokens
    Claude Haiku 4.5 $1.00 $5.00 200K tokens

    Prompt caching reduces costs significantly for repeated context: cache write is 25% of base input price, cache read is 10%. The Batch API offers 50% off all models for non-time-sensitive work. For a full breakdown of how to minimize token spend, see Claude on a Budget: the Complete Guide.

    How Does Claude Pricing Compare to GPT-4o and Gemini 2.0?

    Model Input (per MTok) Output (per MTok)
    Claude Sonnet 4.6 $3.00 $15.00
    Claude Haiku 4.5 $1.00 $5.00
    GPT-4o (OpenAI) $2.50 $10.00
    Gemini 2.0 Flash $0.075 $0.30
    Gemini 2.5 Pro $1.25 $10.00

    Claude Sonnet 4.6 sits above GPT-4o on price but competes at or above it on reasoning tasks. Claude Haiku 4.5 is the cost-competitive option for high-volume pipelines. Gemini 2.0 Flash is significantly cheaper for commodity tasks; the trade-off is reasoning depth and context handling on complex documents.

    Frequently Asked Questions: Claude Pricing

    How much does Claude cost per month?

    Claude costs $0 (Free), $20/month (Pro), $100/month (Max 5x), or $200/month (Max 20x) for individual plans. Team plans start at $25/seat/month (annual, 5-seat minimum). API access is pay-per-token with no monthly minimum.

    Is there a free version of Claude?

    Yes. The Free plan gives access to Claude on web, iOS, Android, and desktop with no credit card required. Usage limits apply and reset on a rolling 5-hour window. The Free tier is suitable for light, exploratory use but not sustained daily work.

    What does Claude Pro include at $20/month?

    Pro includes approximately 5x the usage of Free, Claude Code in the terminal, unlimited projects, the Research feature, file creation, code execution, and Google Workspace integration. Annual billing brings the effective rate to $17/month.

    What is the cheapest way to use Claude?

    The Free plan is the cheapest at $0. For API access, Claude Haiku 4.5 at $1 input / $5 output per MTok is the most cost-efficient model. Combined with the Batch API (50% discount) and prompt caching, high-volume workflows can run at a fraction of standard API cost.

    May 2026: Managed Agents & Claude Security Pricing

    Added May 9, 2026

    Anthropic’s professional services now include Managed Agents and Claude Security. Pricing for both is API-based, not subscription-based.

    Claude Managed Agents Pricing

    Managed Agents pricing follows the standard API token rates for whichever Claude model you use inside the agent pipeline — there’s no separate Managed Agents surcharge on top of model costs. You pay for the tokens the models consume:

    Component Model Used Input / Output per MTok Status
    Multiagent Orchestration Your choice Model rate applies Public beta
    Outcomes Your choice Model rate applies Public beta
    Dreaming (memory refinement) Advisor model (short plan) + executor model Billed separately by role Developer preview

    The Dreaming advisor tool uses a short-plan generation (typically 400–700 tokens) at the advisor model’s rate, while the executor handles full output at its lower rate — keeping combined cost well below running the advisor model end-to-end. Use max_uses to cap advisor calls per request. Requires beta header: anthropic-beta: advisor-tool-2026-03-01. Docs: platform.claude.com/docs/en/managed-agents/dreams

    Claude Security Beta Pricing

    Claude Security is currently in public beta for Enterprise customers. Anthropic has not published a standalone per-scan or per-seat price for Claude Security Beta — access is included as part of Enterprise during the beta period. Underlying model is Claude Opus 4.7 ($5 input / $25 output per million tokens at API rates). For Enterprise pricing including Claude Security, contact Anthropic sales.

    Claude Mythos Preview Pricing (Project Glasswing)

    Claude Mythos Preview is not available via standard API or any subscription tier. Through Project Glasswing (invitation-only, defensive cybersecurity workflows): $25 per million input tokens, $125 per million output tokens. No self-serve access — contact Anthropic for Glasswing information at anthropic.com/glasswing.

    What to do next

    Now that you have the price — here’s how to actually run it

    Knowing the cost is step one. The harder questions are whether Managed Agents is the right architecture for your use case, how it compares to building on the raw API, and what a realistic monthly bill looks like at scale.


    Claude Pricing Calculator (Updated May 15, 2026)

    Use this tool to figure out which Claude plan actually fits your usage, what you’d pay on the API equivalent, and how the new June 15, 2026 Agent SDK billing change affects your costs. All rates verified against Anthropic’s official pricing documentation as of May 15, 2026.

    Tell us how you use Claude





    2 = roughly 30 hours of normal Claude use per month


    Output is typically ~25% of input for chat work


    $ value of unattended Claude work (cron jobs, scripts, GitHub Actions). 0 if you only chat.

    This calculator uses Anthropic’s published API rates as of May 15, 2026. Subscription pricing reflects current public plans. The Agent SDK monthly credit pool launches June 15, 2026 — Pro $20, Max 5x $100, Max 20x $200, Team Standard $20/seat, Team Premium $100/seat.

    Next Steps: What to Read After This

    You came here for pricing. Depending on what you actually need to do next, these are the right places to go:

    If you’re deciding whether to subscribe

    Is Claude Free? What You Actually Get Without Paying

    Walk through the free tier limits and decide if you need to pay at all.

    If you’re working at a team or company

    Claude Team Plan: When to Upgrade and What You Get

    Per-seat pricing, shared usage limits, admin controls, and when Team beats individual Pro.

    If you’re running automation or scripts

    Claude Agent SDK Dual-Bucket Billing: What Changes June 15, 2026

    The new Agent SDK credit pool, what it covers, and what to do before the cutover.

    If you want to actually start building

    Anthropic Console: The Complete Guide to Getting Started

    Set up an API key, navigate the console, and run your first request.

    If you’re a student looking to save

    Claude Student Discount: The Honest Guide to Getting Claude for Less

    No public student discount exists, but here are the legitimate paths to free or reduced access.

    If you’re choosing which model to use

    Claude Models Roadmap May 2026: Opus 4.7, Knowledge Cutoffs, the 1M Context Window

    The current lineup, what each tier costs, and what’s actually verified about Claude 5.

    For the broader operating philosophy of how Claude fits alongside the rest of a working AI stack, see The Three-Legged Stack: Why I Run Everything on Notion, Claude, and Google Cloud.

  • Claude Student Discount: The Honest Guide to Getting Claude for Less (May 2026)

    Claude Student Discount: The Honest Guide to Getting Claude for Less (May 2026)

    Last refreshed: May 15, 2026

    May 2026 Update — Free Plan Left Behind

    Anthropic’s May 2026 SpaceX rate limit increase (doubled 5-hour limits, eliminated peak-hour throttling) explicitly excluded the free plan. If you’re on free and hoping the latest compute expansion helped, it didn’t. This update explains what that means practically and what your actual options are.

    The May 2026 Update: Free Plan Was Explicitly Left Out

    When Anthropic announced doubled rate limits following the SpaceX Colossus 1 compute deal (May 6, 2026), they were specific: the increases apply to Pro, Max, Team, and seat-based Enterprise. The free plan was explicitly excluded.

    This matters for the student/budget conversation because:

    • Free plan rate limits stayed exactly where they were — no improvement
    • The gap between what free users can do and what paid subscribers can do just widened
    • Peak-hours throttling elimination applies to Pro and Max only — not free
    • Claude Code access remains unavailable on free

    If you were waiting to see if Anthropic would upgrade free tier limits alongside the major infrastructure expansion — the answer is no. The business decision is clear: compute improvements go to paying customers first, and the free tier stays constrained to drive conversion.

    What This Means If You’re a Student Trying to Use Claude Free

    You can still use Claude on the free tier. The model you access is capable — Anthropic hasn’t crippled it. What you’re constrained by is how much you can use it before hitting a limit, and how fast it responds during peak hours. Both of those constraints worsened relative to paid tiers in May 2026, because paid tiers got better while free stayed the same.

    For light usage — occasional questions, single documents, short projects — free is still viable. For sustained daily use, research workflows, or anything involving long documents and multiple sessions, free will slow you down in ways that affect your work.

    Quick Answer

    There is no official Claude student discount. Claude Pro costs $20/month for everyone. However, there are three legitimate paths to reduced or free access for students — and one of them covers most student use cases completely.

    The Three Ways Students Actually Get Claude for Less

    Best for most students
    Claude Free Tier
    Access to Claude Sonnet 4.6 with daily usage limits. Sufficient for essay drafting, coding help, summarization, and research. No credit card required. Limits reset daily.
    $0/month — no card needed
    University programs
    Claude for Education
    Anthropic has institutional agreements with select universities. If your school has a deal, access may be included in your student account. Check with your IT department or university library — coverage is expanding but not universal.
    Free if your school participates
    API credits
    GitHub Student Developer Pack
    GitHub’s student pack periodically includes credits for AI tools and APIs. Availability changes — check current offers at education.github.com. Requires a .edu email or institutional verification.
    Variable — check current offers
    Full access
    Claude Pro — $20/month
    5x more usage than free, priority access during peak hours, access to Claude Opus 4.7 for complex tasks. No student discount, but the free tier covers most student workloads without it.
    $20/month — no discount available

    What the Free Tier Actually Gets You

    Most students overestimate how much Claude Pro they need. The free tier handles:

    • Essay feedback and drafting assistance
    • Coding help — debugging, explaining concepts, generating boilerplate
    • Research summarization — paste an article or paper, get a structured summary
    • Math and problem-set walkthroughs
    • Study guide generation from lecture notes

    Where you’ll hit limits: long research sessions on a single topic, processing multiple long documents in the same conversation, or high-volume API access for a class project. For those cases, Claude Pro or API credits are the right call.

    Claude for Education — Current Status

    Anthropic’s education program is expanding but not yet universal. The fastest way to find out if your institution participates is to email your university’s IT department or check whether your library already has a Claude subscription that extends to students.

    Harvard, for example, replaced ChatGPT Edu with Claude in 2026 — so institutional deals are happening. If your school hasn’t moved yet, it may soon.

    What Claude Pro Is Actually Worth for Students

    If you’re doing intensive AI-assisted work — a thesis, a capstone project, a research paper that requires synthesizing many sources — $20/month is reasonable for a semester. Many students find they need it for two or three months out of the year and can drop to free for the rest.

    There’s no annual commitment required. You can subscribe month-to-month and cancel when the project is done.

    Bottom Line

    Start with the free tier. It covers the majority of student use cases. If you hit the limit consistently, check whether your university has an institutional deal before paying. If neither works for your project, Claude Pro at $20/month is month-to-month with no lock-in.

    For teams making a buying decision

    Evaluating Claude for a team — not just yourself?

    If you’re working through the plan decision for a business or agency, the calculus is different than individual use. We’ve run this math across 20+ client accounts and can tell you exactly where the API breaks even vs. subscription, and which plan structure makes sense for your workload.

    Get a plan recommendation →