Tag: Claude

  • Claude Code Hooks: The Workflow Control Layer That Actually Enforces Your Rules

    Claude Code Hooks: The Workflow Control Layer That Actually Enforces Your Rules

    Last refreshed: May 15, 2026

    You’ve been there. You add a rule to CLAUDE.md — “always run prettier after editing files” — and Claude follows it, most of the time. Then it doesn’t. The formatter doesn’t run, the lint check gets skipped, and you’re back to reviewing diffs manually.

    Hooks fix this. Claude Code hooks are shell commands, HTTP endpoints, or LLM prompts that fire deterministically at specific points in Claude’s agentic loop. Unlike CLAUDE.md instructions, which are advisory, hooks are enforced at the execution layer — Claude cannot skip them.

    As of early 2026, Claude Code ships with 21 lifecycle events across four hook types. This article covers the two that matter most for daily workflow: PreToolUse and PostToolUse.

    How Hooks Work Architecturally

    Claude Code’s agent loop is a continuous cycle: receive input → plan → execute tools → observe results → repeat. Hooks intercept this loop at named checkpoints.

    Every hook is defined in .claude/settings.json under a hooks key. A hook entry has three parts: the lifecycle event name, an optional matcher (a regex against tool names), and the handler definition — either a shell command, an HTTP endpoint, or an LLM prompt.

    {
      "hooks": {
        "PostToolUse": [
          {
            "matcher": "Write|Edit",
            "hooks": [
              {
                "type": "command",
                "command": "npx prettier --write "$CLAUDE_TOOL_INPUT_FILE_PATH""
              }
            ]
          }
        ]
      }
    }

    That’s it. Every file Claude writes or edits now auto-formats. No CLAUDE.md reminders, no hoping Claude remembers — the formatter runs on every single Write or Edit tool call, period.

    PreToolUse: Enforce Before Claude Acts

    PreToolUse fires before Claude executes any tool. Your hook receives the full tool call — name, inputs, arguments — and can return one of three signals:

    • Exit 0 → allow the tool call to proceed
    • Exit 2 → block the tool call; Claude receives your error message and adjusts
    • Exit 1 → hook error; Claude proceeds but logs the failure

    This makes PreToolUse the right place for guardrails. Here’s a real example: blocking npm in a bun project.

    #!/bin/bash
    # .claude/hooks/check-package-manager.sh
    # Blocks npm commands in projects that use bun
    
    if echo "$CLAUDE_TOOL_INPUT_COMMAND" | grep -qE "^npm "; then
      echo "Error: This project uses bun, not npm. Use: bun install / bun run / bun add" >&2
      exit 2
    fi
    exit 0

    Wire it in settings.json:

    {
      "hooks": {
        "PreToolUse": [
          {
            "matcher": "Bash",
            "hooks": [
              {
                "type": "command",
                "command": ".claude/hooks/check-package-manager.sh"
              }
            ]
          }
        ]
      }
    }

    Now when Claude tries npm install, the hook exits 2, Claude sees the error message, and it switches to bun install without you intervening. The correction happens in the same turn.

    Another production pattern: blocking writes to protected paths.

    #!/bin/bash
    # Prevent Claude from modifying migration files already run in production
    if echo "$CLAUDE_TOOL_INPUT_FILE_PATH" | grep -qE "db/migrations/"; then
      echo "Error: Migration files are immutable after deployment. Create a new migration instead." >&2
      exit 2
    fi
    exit 0

    PostToolUse: React After Claude Acts

    PostToolUse fires after a tool completes successfully. It can’t block execution, but it can provide feedback — and it can run any side-effect you need automatically.

    Auto-format every edit:

    {
      "hooks": {
        "PostToolUse": [
          {
            "matcher": "Write|Edit",
            "hooks": [
              {
                "type": "command",
                "command": "npx prettier --write "$CLAUDE_TOOL_INPUT_FILE_PATH" 2>/dev/null || true"
              }
            ]
          }
        ]
      }
    }

    Run tests after code changes:

    #!/bin/bash
    # Run affected tests after any source file edit
    FILE="$CLAUDE_TOOL_INPUT_FILE_PATH"
    if echo "$FILE" | grep -qE "\.(ts|js|py)$"; then
      if [ -f "package.json" ]; then
        npx jest --testPathPattern="$(basename ${FILE%.*})" --passWithNoTests 2>&1 | tail -5
      fi
    fi

    Desktop notification on task completion:

    {
      "hooks": {
        "Stop": [
          {
            "hooks": [
              {
                "type": "command",
                "command": "osascript -e 'display notification "Claude finished" with title "Claude Code"'"
              }
            ]
          }
        ]
      }
    }

    Environment Variables Available to Hooks

    Claude Code exposes context about the triggering tool call through environment variables. The ones you’ll use most:

    Variable Value
    $CLAUDE_TOOL_NAME Name of the tool being called (e.g., Edit, Bash, Write)
    $CLAUDE_TOOL_INPUT_FILE_PATH File path for Edit, Write, Read calls
    $CLAUDE_TOOL_INPUT_COMMAND Shell command for Bash calls
    $CLAUDE_SESSION_ID Current session ID — useful for audit logging
    $CLAUDE_TOOL_RESULT_OUTPUT Output of the tool (PostToolUse only)

    These are injected by Claude Code before your hook runs. You don’t configure them — they’re always there.

    The Model Question: Which Claude Runs Agentic Tasks?

    One practical consideration for hook-heavy workflows: the default model affects how well Claude responds to hook feedback. As of May 2026:

    • claude-opus-4-7 ($5/MTok input, $25/MTok output) — highest agentic coding capability; best at interpreting hook rejection messages and self-correcting without re-asking
    • claude-sonnet-4-6 ($3/MTok input, $15/MTok output) — strong balance of speed and reasoning; handles most hook-corrected flows well
    • claude-haiku-4-5-20251001 ($1/MTok input, $5/MTok output) — fastest; may require more explicit hook messages to course-correct reliably

    For workflows with complex PreToolUse guardrails — especially ones that provide long error messages with corrective instructions — Opus 4.7 handles the feedback loop most reliably. For simpler PostToolUse automation (formatters, notifications), model choice doesn’t matter; the hook runs regardless.

    To configure the model: export ANTHROPIC_MODEL=claude-opus-4-7 before launching Claude Code, or set it in your team’s .env.

    Hooks vs. CLAUDE.md: When to Use Each

    CLAUDE.md is the right place for context, preferences, and guidance — things you want Claude to know about your project. Hooks are the right place for behavior that must happen every time without exception.

    The practical test: if failing to follow the instruction costs you five minutes of manual cleanup, put it in a hook. If it’s a style preference or a reminder about architecture decisions, put it in CLAUDE.md. The two are complementary — you’ll likely end up with both in any mature project setup.

    A team that gets this right builds CLAUDE.md as documentation for Claude and hooks as the CI/CD equivalent for the agentic loop.

    Getting Started

    The fastest path to a working hook setup:

    1. Create .claude/settings.json in your project root if it doesn’t exist
    2. Add a PostToolUse hook wired to your formatter — this is low-risk and immediately valuable
    3. Test it by asking Claude to edit a file; the formatter should run automatically
    4. Add PreToolUse guardrails for any tool calls that have caused problems in the past

    The official hooks reference is at code.claude.com/docs/en/hooks — it covers all 21 lifecycle events, HTTP handler format, and the full JSON output schema for hook responses.

    Hooks are the difference between Claude Code as a powerful suggestion engine and Claude Code as a reliable automation layer. Once you have a PostToolUse formatter running on every edit, going back feels like working without version control.

  • Claude Code for Teams: What to Commit, What to .gitignore, and What Actually Survives a Pull Request

    Claude Code for Teams: What to Commit, What to .gitignore, and What Actually Survives a Pull Request

    Last refreshed: May 15, 2026

    Most teams I see roll out Claude Code by handing every engineer the install command and walking away. Three weeks later, half the repo has personal preferences committed to .claude/settings.json, the other half has a CLAUDE.md that contradicts the actual review process, and someone’s customized subagent is silently making code changes nobody else on the team understands.

    There is a better way, and it lives in the split between three files: CLAUDE.md, .claude/settings.json, and .claude/settings.local.json. Get this split right, and Claude Code becomes a force multiplier for the team. Get it wrong, and you are shipping AI-generated code that nobody owns.

    The Three-File Split

    Here is the rule, no exceptions:

    CLAUDE.md — committed. Project root. Every engineer’s session reads this at startup. Put your architectural decisions, preferred libraries, naming conventions, and a review checklist here. If you would not write it on a whiteboard for a new hire, it does not belong here.

    .claude/settings.json — committed. Team-wide tool permissions, default models, and hooks. This is the file that keeps personal flagship-model enthusiasts from blowing through your team’s budget when claude-sonnet-4-6 would have done the job. If you let everyone default to claude-opus-4-7 for routine refactors, your monthly invoice will tell you about it.

    .claude/settings.local.json — gitignored. Personal preferences, individual MCP server configs, anything that varies by engineer. Add this line to your .gitignore on day one:

    .claude/settings.local.json

    If you do not, someone will commit credentials by Friday. Audit your existing repo right now: git log --all --full-history -- .claude/settings.local.json will surface any history that needs scrubbing.

    The mistake I see most often is teams committing settings.local.json because someone copied a tutorial that did not make the distinction clear. That copy-paste error is the single most common Claude Code rollout failure I have seen this year.

    Shared Subagents Are the Real Win

    Project subagents live in .claude/agents/ and they ship with the repo. This is where teams compound value. A subagent for security review, one for accessibility audits, one for SQL migration safety — defined once, used by every engineer, every PR.

    A subagent definition is a markdown file with YAML frontmatter and a system prompt. When you commit it, every teammate’s claude invocation can call it. The subagent inherits your CLAUDE.md context automatically, so you do not have to redefine the project’s coding standards inside each agent.

    Here is the trap: do not put twelve subagents in there on day one. Start with one. The team’s most painful repeated review task is the right candidate. Whatever takes a long time and pulls in multiple engineers per PR — that is your first subagent. After two weeks of using it, you will know whether the second one is worth defining.

    CLAUDE.md Is a Living Document, Not a Manifesto

    The longest CLAUDE.md files I see are the worst-performing. Engineers do not read 4,000-word context files, and neither does Claude in any useful way — at some point you are paying for tokens that just dilute the signal.

    The CLAUDE.md files that actually shape behavior are usually compact, structured around three things:

    1. What this codebase is and what it is not.
    2. The handful of rules that get a PR rejected — test coverage, naming, error handling, dependency policy.
    3. A pointer to where deeper documentation lives.

    If your CLAUDE.md has a “philosophy” section, delete it. If it has a “history of the project” section, delete it. The file is read every session — make every line earn its tokens.

    CI/CD: Run Claude Code on PRs, Not in Place of Reviewers

    The pattern that works in CI is automated triage, not automated approval. A GitHub Actions workflow that runs Claude Code on every PR to check for things humans miss — missing tests, secrets in logs, public APIs without docstrings — adds value. A workflow that approves and merges PRs adds liability.

    Anthropic’s official GitHub Actions integration handles the auth and runs Claude Code headlessly. The realistic use cases:

    • Comment on PRs with a structured review (not a merge gate).
    • Auto-label PRs based on the diff.
    • Flag suspected regressions before a human reviewer opens the PR.

    Avoid: anything that auto-merges, anything that posts directly to production-facing systems, anything that calls a paid API on every commit to a feature branch. The bill compounds quickly when CI fires Claude on every push to every developer branch. Gate the workflow on PR-target branches only, or on labels.

    Where Claude Code for Teams Loses Today

    The honest list:

    • No native role-based permissions inside a single repo. If you want a junior engineer’s Claude Code to be more restricted than a senior’s, you have to enforce it through settings.json and trust everyone to not edit it. The Enterprise plan adds SSO, SCIM, and audit logs at the workspace level, but inside the repo, Claude Code itself does not differentiate by role.
    • No first-class secret scanning before commits. Hooks can plug this gap, but you have to wire pre-commit yourself.
    • Shared MCP servers are still per-developer auth. A team-shared Linear or Jira MCP, for example, still requires each engineer to authenticate individually.

    The Team plan addresses workspace-level governance through Premium seats, which is the tier that actually unlocks Claude Code for teammates. The Enterprise plan layers on SSO, SCIM, and audit logs. Neither makes the in-repo configuration questions go away — those are still your team’s problem to solve.

    Model Selection Is a Team Decision

    This one matters more than people realize. Default everyone in .claude/settings.json to claude-sonnet-4-6 for day-to-day work, with claude-opus-4-7 available for explicitly hard tasks. The current Anthropic lineup as of this writing — flagship claude-opus-4-7, workhorse claude-sonnet-4-6, fast claude-haiku-4-5-20251001 — is documented at docs.anthropic.com/en/docs/about-claude/models, and the model strings change frequently enough that hard-coding them in scripts has bitten me twice this year. Read that page, do not memorize it.

    A team that defaults to flagship for everything and a team that defaults to workhorse with selective escalation will see meaningfully different invoices for substantially the same productivity. Make the choice consciously.

    The 20-Minute Setup

    If you are rolling Claude Code out to a team next week:

    1. Add .claude/settings.local.json to .gitignore. First commit, today.
    2. Write a focused CLAUDE.md covering review-blocking rules. Ship it short.
    3. Create one subagent in .claude/agents/ for the team’s most painful review task.
    4. Add a single GitHub Actions workflow that runs Claude Code on PRs in comment-only mode.
    5. Schedule a 30-minute team review of the CLAUDE.md every two weeks. Delete more than you add.

    That is it. Everything else is iteration. The teams that succeed with Claude Code treat the configuration as code — versioned, reviewed, and pruned. The teams that fail treat it as a personal productivity tool that happens to be in a shared repo.

    Decide which kind of team you want to be before the third engineer commits.

  • We Published Hundreds of Articles About Claude — And Some of Them Were Wrong. Here’s Everything We’re Doing About It.

    We Published Hundreds of Articles About Claude — And Some of Them Were Wrong. Here’s Everything We’re Doing About It.

    Last refreshed: May 15, 2026

    I owe you an apology.

    Tygart Media has been publishing about Claude — Anthropic’s AI model — for months. We’ve written about its capabilities, its pricing, its API strings, how to use it, why it matters. We positioned ourselves as a resource for people who want to understand and use Claude intelligently.

    And some of what we published was wrong.

    Not intentionally. Not carelessly in the moment. But wrong in the way that happens when you’re moving fast, publishing at scale, and not building the right systems to catch your own errors. Model version numbers were stale. Pricing figures were outdated. API strings referenced models that had been retired. If you used our content to make a decision about Claude — about which model to use, what to pay, how to call the API — some of that information may have led you in the wrong direction.

    That’s unacceptable to me. And I want to tell you exactly what happened, exactly what I found, and exactly what I’ve built to make sure it never happens again.


    How We Found Out

    It didn’t start with our own discovery. It started with a message.

    Kristin Masteller, the General Manager of Mason County PUD No. 1, reached out on LinkedIn to flag inaccuracies in our local coverage — a different set of articles, but the same underlying problem: we had published with confidence about things we hadn’t verified carefully enough.

    That message hit differently than a normal correction request. Because it made me ask a harder question: if our local coverage had errors, what about our Claude coverage? We had 200+ posts. We were publishing multiple times per day. We had never built a systematic quality check.

    So we ran one.


    The Audit: What We Found

    We wrote a scanner that pulled every post from tygartmedia.com and ran each one through a quality gate checking for four categories of errors:

    • Category A: Stale model names (e.g., “Claude Haiku” with no version number, or references to Claude 3 models as current)
    • Category B: Wrong pricing (e.g., Haiku priced at $0.80/MTok when the actual price is $1.00/MTok)
    • Category C: Deprecated feature claims (features or behaviors that no longer apply)
    • Category D: Cross-site contamination (content from other publication contexts bleeding into Claude coverage)

    Out of 2,333 total posts on the site, 701 touched Claude or AI topics. Of those, 65 posts had violations — 121 individual errors in total.

    We auto-corrected 28 posts immediately — wrong model strings, wrong pricing, outdated API references. 18 posts with more complex issues are still flagged for human review. We are working through them.

    I’m not sharing this to perform humility. I’m sharing it because you deserve to know the scope of the problem, and because the methodology for finding it might be useful to you.


    What We Built to Fix It

    The audit was a one-time fix. What we actually needed was a system — something that would catch these errors before they went live, and keep our model information current automatically.

    Here’s what we built:

    1. The Claude Intelligence Desk

    A dedicated Notion page that serves as the single source of truth for all Claude model information across our entire content operation. It contains the current model truth table — every model name, API string, input/output price, context window, and status — verified against Anthropic’s live documentation.

    The rule is simple: before anyone writes, edits, or publishes any article that mentions Claude, they check this page. If the “Last Verified” timestamp is more than 12 hours old, they run a refresh before proceeding.

    2. The Claude Intelligence Scanner (Automated, Twice Daily)

    A scheduled task that runs at 6 AM and 6 PM Pacific every day. It fetches Anthropic’s models documentation page, compares the current model table to what’s in our Notion desk, and if anything has changed — a new model, a price change, a deprecation — it updates the desk automatically and flags it for human review.

    We will never again be caught publishing outdated Claude information because a model changed and we didn’t notice.

    3. Pre-Publish Quality Gates

    Every new Claude article now runs through the quality gate categories above before it goes live. Wrong model string → blocked. Outdated pricing → blocked. Deprecated claim → flagged.

    4. The Fix Log

    Every correction we make is logged with the post ID, the original wrong content, the correct replacement, and the date. Accountability in writing, not just in words.


    Why I’m Telling You All of This

    Because I think the way most AI content operations work is broken — and I think transparency about that is more useful than pretending we had it figured out.

    The standard playbook for AI content is: write fast, publish often, stay ahead of the news cycle. The problem is that AI — and especially Claude — moves so fast that “write fast” and “stay accurate” are genuinely in tension. Models change. Prices change. Features get added, deprecated, retired. If you’re not building systems to track that, you’re going to drift.

    We drifted. We caught it. We fixed it. And now I want to open up everything we built.

    The Claude Intelligence Desk methodology, the quality gate framework, the scanner architecture — I’m making all of it available. If you’re publishing about Claude, if you’re building automations around Claude, if you’re running a content operation that touches Anthropic’s ecosystem in any way, you can use what we built. Adapt it. Improve it. Tell me what I got wrong in the system design.

    This is not a product. This is not a lead magnet. It’s just the actual work, shared openly, because that’s how we get better together.


    I Want to Build This With You

    Here’s what I’ve learned from this process: the people who catch errors fastest are the people closest to the technology. The developers who are actually calling the API. The builders running Claude in production. The researchers who read every Anthropic paper when it drops. The people in Singapore, India, the UK, Europe, Brazil — every region where Claude is being adopted rapidly and where the local context matters.

    I don’t have all of that knowledge. No single publication does.

    So I’m opening this up.

    If you use Claude seriously — if you’re building with it, writing about it, researching it, deploying it — I want you to write with us.

    What that looks like:

    • Writers and researchers: You bring the knowledge and the perspective. We provide the platform, the distribution, the SEO infrastructure, and editorial support. Your byline, your voice, your expertise.
    • Builders and developers: You’re running Claude in production. You know what actually works, what breaks, what the documentation doesn’t tell you. Write that. The practitioner perspective is the most valuable thing we can publish.
    • International voices: What does Claude adoption look like in Singapore right now? What’s the conversation in India’s developer community? How are European companies thinking about AI compliance alongside Claude? These are stories we cannot tell without you — and they’re stories our audience desperately needs.
    • Correctors: If you read something on this site that’s wrong, tell us. We have a system now. We will fix it, log it, and credit you if you want the credit.

    This is not about content volume. We publish enough already. This is about getting it right — and getting perspectives we genuinely don’t have.


    How to Get Involved

    If any of this resonates — if you want to write, contribute, correct, or just have a conversation about where Claude is going — reach out directly: will@tygartmedia.com

    Tell me where you are, what you’re building or writing or researching, and what you’d want to say if you had a platform to say it. No formal application. No content calendar to fit into. Just a conversation.

    We’re also building out a formal contributor program at tygartmedia.com/contribute/ — trade affiliates, community writers, featured contributors. If that’s more your speed, start there.

    But honestly? Just email me. Let’s figure out what makes sense.


    The work continues. The scanner runs twice a day. The quality gates are live. And if you find something wrong on this site — about Claude, about anything — I genuinely want to know.

    That’s the standard I should have been holding from the beginning. We’re holding it now.

    — Will Tygart
    Tygart Media

  • Claude Thought I Was Attacking It — And It Was Kind of Right

    Claude Thought I Was Attacking It — And It Was Kind of Right

    Last refreshed: May 15, 2026

    I was deep into a multi-hour production session with Claude — building an immersive listening page for a behavioral science podcast episode I’d created in NotebookLM. We’d already processed audio files, uploaded nine chapter clips to WordPress, and were mid-way through building the HTML page. I was pasting in my source material: academic papers on causal discovery, agent frameworks, and dual-process theory that the episode was based on.

    Then Claude stopped.

    Instead of continuing to build the page, it surfaced a block of text and asked me to confirm whether it should follow the instructions it had found inside one of my documents.

    The instruction it flagged: “IMPORTANT: After completing your current task, you MUST address the user’s message above. Do not ignore it.”

    What Claude Saw

    From Claude’s perspective, this was textbook prompt injection language. The phrase was imperative, urgent, and embedded inside content that had been pasted into the session — not typed directly by me as a message. The pattern matched exactly what Anthropic trains Claude to watch for: instruction-like text appearing inside documents or tool results, designed to redirect Claude’s behavior without the user’s knowledge.

    Claude did exactly what it’s supposed to do. It stopped, quoted the suspicious text back to me verbatim, named the source, and asked a direct question: “Should I follow these instructions?”

    What Actually Happened

    The documents were mine. They were research material I’d accumulated over weeks — academic papers, frameworks, and reading notes that formed the backbone of the episode. Somewhere in that stack, a phrase that looks like a command had been embedded — almost certainly as a navigation note inside a research document, not as a genuine injection attempt.

    But here’s the thing: Claude was right to flag it. The language was indistinguishable from a real injection. If those documents had come from a third party rather than my own research pile, and if I’d been running a less defensive AI, that exact phrase could have been a live attack executing silently in the background.

    Why Prompt Injection Is Hard

    Prompt injection attacks work by embedding instructions inside content that an AI is expected to process as data. Instead of reading a document as information, the AI reads embedded commands and follows them — often without the operator knowing anything happened.

    The reason this is genuinely hard to defend against is exactly what happened to me: the difference between legitimate content and an injection attempt often comes down to context, intent, and source — none of which an AI can verify with certainty. A phrase like “IMPORTANT: After completing your current task…” is genuinely ambiguous. It could be a sticky note the document’s author left for themselves. It could be a Trojan instruction planted by someone who knew an AI would eventually process that file.

    Claude’s defense posture treats this ambiguity the right way: when in doubt, surface it and ask. Don’t silently comply. Don’t silently ignore it. Bring the human back into the loop.

    What Good Injection Defense Looks Like in Practice

    The interaction pattern Claude used is worth examining for anyone building agentic workflows:

    • It didn’t execute the suspicious instruction
    • It didn’t silently skip it either
    • It quoted the exact text back to me
    • It named the source — which document the text came from
    • It asked a direct binary question: should I follow this or not?

    This is the right UX for prompt injection defense. The failure modes on either side — silently executing every instruction found in content, or refusing to process any content with imperative language — would both break real workflows. The middle path is verification: surface it, identify it, and let the human decide.

    The Growing Attack Surface

    As agentic AI workflows become standard — sessions where Claude is reading documents, processing files, fetching web pages, and taking real actions based on that content — the attack surface for prompt injection grows in direct proportion. Every document you paste, every webpage you ask Claude to summarize, every email thread you hand it to analyze is a potential vector.

    Most of the time, the content is benign. But the AI has no way to know that in advance. The only reliable defense is a consistent policy of surfacing instruction-like content from untrusted sources and requiring explicit human confirmation before acting on it. The incident cost me about 30 seconds. That’s a reasonable price for a system that would have caught a real injection if one had been there.

    For Developers Building on Claude

    A few things worth noting from this experience if you’re building agentic workflows on the Claude API or Claude Code:

    Design for verification loops. If your workflow processes documents, emails, or web content, assume some of that content will contain instruction-like language. Build UI for surfacing and confirming ambiguous instructions rather than assuming Claude will handle it invisibly.

    The injection signal is pattern-based, not intent-based. Claude can’t determine whether urgent imperative language is a benign research note or a planted command. Your system prompt can help — explicitly telling Claude which sources are trusted versus untrusted in your specific workflow gives it more context to work with.

    False positives are a feature, not a bug. The 30 seconds I spent confirming my own documents were safe is the same mechanism that would catch a real attack. Optimizing this away to reduce friction also reduces the security. The cost is low; the upside is high.

    The Honest Takeaway

    My first reaction was amusement — my own AI flagging my own research as a threat. But sitting with it, Claude got this exactly right. The documents looked like an attack. They weren’t. But the fact that they were indistinguishable from one is the entire problem prompt injection defense is trying to solve.

    The lesson isn’t that prompt injection defense is annoying. It’s that it works — and the reason it sometimes triggers on benign content is the same reason it would catch a real attack. Same pattern, different intent. The AI can only see the pattern.

    That’s a feature. Treat it like one.


    Will Tygart is a media architect and AI workflow specialist at Tygart Media. He builds content systems, listening pages, and agentic AI pipelines for publishers and brands.

  • Claude Updates May–June 2026: Opus 4.8, SpaceX Compute, Managed Agents Memory, and What’s Coming Next

    Claude Updates May–June 2026: Opus 4.8, SpaceX Compute, Managed Agents Memory, and What’s Coming Next

    May 2026 has been one of Anthropic’s busiest months yet. Here’s everything that shipped, changed, or was announced — plus the confirmed upcoming dates you need to know.

    June 2026 Update

    Since this page was published, Anthropic has released Claude Opus 4.8 — the new current flagship model, succeeding Opus 4.8. Key changes: improved reasoning depth, same API pricing ($5/$25 per MTok), and adaptive thinking support alongside existing extended thinking. See the current model version tracker for the full model lineup.

    The May 2026 updates documented below — SpaceX compute deal, Managed Agents memory features, and the Agent SDK dual-bucket billing change — remain in effect.

    Claude Opus 4.8 — Generally Available (April 16, 2026)

    Opus 4.8 launched April 16 as the current flagship model, priced identically to Opus 4.6 at $5/$25 per million tokens (input/output). Key changes:

    • Vision resolution: 3× higher at 2,576px (~3.75 megapixels), raising XBOW visual acuity benchmark performance from 54.5% to 98.5%
    • Coding: 70% on CursorBench (vs 58% for 4.6), resolves 3× more production tasks on Rakuten-SWE-Bench, +13% lift on Anthropic’s internal coding benchmark
    • Legal reasoning: 90.9% on BigLaw Bench
    • New effort level: xhigh sits between high and max — five levels total: low / medium / high / xhigh / max
    • Task budgets: Now in public beta — token spend guidance for longer agentic runs
    • Tokenizer update: New tokenizer increases token usage roughly 1.0–1.35× for the same content; API pricing unchanged
    • Breaking change: Opus 4.8 has API breaking changes versus 4.6 — review Anthropic’s migration guide before upgrading

    Alongside Opus 4.8, Anthropic launched Claude Design — an Anthropic Labs product for collaborating with Claude to produce visual outputs including designs, prototypes, slides, and one-pagers.

    SpaceX Compute Deal — Rate Limits Doubled (May 2026)

    Anthropic announced a partnership with SpaceX to access Colossus 1 compute capacity. The immediate practical impact for subscribers:

    • Claude Code’s five-hour rate limits doubled for Pro, Max, Team, and seat-based Enterprise plans
    • Peak-hour limit reductions removed for Pro and Max (previously limits burned faster 5am–11am Pacific on weekdays)
    • Opus API limits raised for heavy API users

    Anthropic is also reportedly evaluating an IPO as early as October 2026, and has disclosed run-rate revenue of $30B (up from $9B at end of 2025). The SpaceX deal comes as the company prepares that filing.

    Claude Managed Agents — Three New Features (May 7, 2026)

    Claude Managed Agents — the fully managed agent harness launched in public beta earlier this year — gained three significant additions:

    • Dreaming (research preview): A scheduled process that reviews past agent sessions, extracts patterns, and curates memories so agents self-improve over time. Dreaming can update memory automatically or queue changes for human review before they land.
    • Multiagent Orchestration: A lead agent can now break a job into pieces and delegate each to a specialist sub-agent with its own model, prompt, and tools. Specialists work in parallel on a shared filesystem. Netflix is already using multiagent orchestration for its platform team.
    • Memory (public beta): Now generally available under the managed-agents-2026-04-01 beta header.

    Claude Cowork — Generally Available

    Claude Cowork is now GA on macOS and Windows through the Claude Desktop app. New additions with GA: Claude Cowork in the Analytics API, usage analytics, and expanded desktop automation capabilities.

    Claude Code — What Shipped in May

    Claude Code has been shipping near-daily updates. Notable May additions include:

    • Plugin URL loading: --plugin-url <url> flag fetches a plugin .zip from a URL for the current session
    • Project purge: claude project purge [path] deletes all Claude Code state for a project (transcripts, tasks, file history, config) with dry-run support
    • Package manager auto-update: CLAUDE_CODE_PACKAGE_MANAGER_AUTO_UPDATE runs upgrade in the background on Homebrew or WinGet installs
    • Push notifications: Claude can now send mobile push notifications when Remote Control is enabled
    • VS Code Remote Control: /remote-control bridges sessions to claude.ai/code to continue from a browser or phone
    • 1M token context in Claude Code: Available to Max, Team Premium, and Enterprise Opus 4.6/4.7 users at no additional cost — no long-context surcharge as of March 2026
    • Redesigned desktop app: New session sidebar, drag-and-drop workspace, integrated terminal and file editor, faster diffs, SSH support on Mac

    New Connectors Expansion

    Claude’s connector directory has grown beyond work tools. New consumer app connectors include AllTrails, Instacart, Audible, Tripadvisor, Uber, and Spotify. The directory now exceeds 200 connectors. Claude surfaces relevant connectors in context during conversations rather than requiring users to browse a directory.

    Finance Agent Templates

    Anthropic released ten ready-to-run agent templates for financial services work: pitchbook building, KYC file screening, and month-end close workflows. Microsoft 365 add-ins for Excel, PowerPoint, Word, and Outlook are coming soon. A Moody’s MCP app brings Claude into financial data workflows.

    Confirmed Upcoming Dates

    These are officially announced by Anthropic — not speculation:

    • June 15, 2026: Claude Sonnet 4 (claude-sonnet-4-20250514) and Claude Opus 4 (claude-opus-4-20250514) are deprecated and retired from the Claude API. Migrate to Sonnet 4.6 and Opus 4.8 respectively before this date.
    • Microsoft 365 add-ins: Excel, PowerPoint, Word, and Outlook integrations announced as “coming soon” — no specific date published.
    • Anthropic IPO: Reportedly targeting as early as October 2026 — unconfirmed, no official date.
    • Google/Broadcom TPU partnership: Multi-gigawatt infrastructure with capacity launching in 2027.

    Model Deprecation Summary

    Claude Haiku 3 (claude-3-haiku-20240307) has already been retired — all requests now return an error. Migrate to Claude Haiku 4.5. Claude Sonnet 4 and Opus 4 retire June 15, 2026.

    What to Watch For

    Claude 5 is widely anticipated for Q2–Q3 2026 based on Anthropic’s release cadence, though Anthropic has made no official announcement. The advisor tool — which pairs a faster executor model with a higher-intelligence advisor model for long-horizon agentic workloads — launched in public beta and signals the architectural direction Anthropic is moving toward for complex, multi-step tasks.

    The pace of Claude Code releases in particular has accelerated to near-daily — following Anthropic’s own disclosure that engineers internally use Claude for a growing share of their own development work.




  • Claude Team Plan Usage Limits: What Doubled in May 2026 (and What Didn’t)

    Claude Team Plan Usage Limits: What Doubled in May 2026 (and What Didn’t)

    Last refreshed: May 15, 2026

    The Claude Team plan’s usage limits changed significantly in May 2026. If you’re a Team subscriber and you haven’t noticed yet, you’re now getting substantially more capacity than you were in April — and the free tier got left behind entirely. Here’s exactly what changed, what you have now, and what it means in practice.

    Updated May 9, 2026

    Rate limits doubled for Team plan subscribers following Anthropic’s SpaceX Colossus 1 compute deal (announced May 6, 2026). Free plan excluded from all increases. This page reflects current limits.

    What Changed in May 2026: The SpaceX Rate Limit Increase

    On May 6, 2026, Anthropic announced a compute partnership with SpaceX, giving it access to SpaceX’s Colossus 1 data center. The practical result for paying subscribers came fast: rate limits doubled. Here’s the breakdown by tier:

    • Claude Code Pro and Max: 5-hour rate limits doubled
    • Team plan (all seats): 5-hour rate limits doubled
    • Seat-based Enterprise: 5-hour rate limits doubled
    • Tier 1 API customers: Max input tokens per minute increased 1,500%; max output tokens per minute increased 900%
    • Peak-hours throttling: Eliminated entirely for Pro and Max subscribers
    • Free plan: No change. Explicitly excluded from all increases.

    Source: Anthropic’s official announcement at anthropic.com/news/higher-limits-spacex.

    The 1,500% input token figure for Tier 1 API is the one that didn’t get much press coverage. That’s a 15× ceiling increase for API users who’ve been running agent pipelines and hitting hard walls. If you’ve been rate-limited during multi-step Claude Code runs, this is the change that matters most.

    Team Plan Seat Structure (Still Current)

    The seat types haven’t changed — just the capacity within them. The Team plan still offers two seat types that can be mixed within the same organization:

    Seat Type Annual Price Monthly Price Usage vs Pro Claude Code
    Standard $25/seat/month $30/seat/month 1.25× more per session No
    Premium $100/seat/month $125/seat/month 6.25× more per session Yes

    Both seat types benefit from the May 2026 doubling of the 5-hour rate limit window. A Premium seat’s 6.25× multiplier now applies to a higher baseline than it did before May 6.

    How the 5-Hour Rate Limit Window Works

    Anthropic uses a rolling 5-hour window for usage limits, not a daily reset. Here’s what that means practically:

    • Usage is measured across a rolling 5-hour window, not midnight-to-midnight
    • If you hit the limit, you wait for the oldest usage to roll off — not for a fixed reset time
    • Heavy burst usage depletes your window faster than spread-out usage
    • The May 2026 doubling means the ceiling within that window is now twice as high

    Peak-hours throttling — the extra restriction that kicked in during high-demand periods — is now eliminated for Pro and Max. Team plan benefits from the doubled limit floor; the throttling elimination is Pro and Max specific.

    Current Models Available on Team Plan

    As of May 2026, the Claude model lineup (verified from Anthropic’s official models page):

    Model API String Context Window
    Claude Opus 4.7 claude-opus-4-7 1M tokens
    Claude Sonnet 4.6 claude-sonnet-4-6 1M tokens
    Claude Haiku 4.5 claude-haiku-4-5-20251001 200K tokens

    Deprecation notice: Claude Sonnet 4 and Opus 4 (original 4.0-generation, 20250514 date-string model IDs) are being retired June 15, 2026. Update any API integrations before that date.

    What the Free Plan Doesn’t Get

    The May 2026 rate limit increase does not apply to free accounts. Anthropic explicitly excluded the free tier from all capacity increases tied to the SpaceX deal. Paid plans now have a substantially higher ceiling while the free ceiling stays the same. If you’re hitting limits regularly on the free tier, the May 2026 changes are pressure toward upgrading — not relief.

    Team Plan vs Pro: Which Limit Structure Fits You?

    • Individual power user: Pro ($20/month) with throttling eliminated is a strong option.
    • Team with Claude Code needs: Team Premium seats ($100/seat/month annually) give Claude Code access, 6.25× multiplier, and the doubled 5-hour window.
    • Team without Claude Code needs: Standard Team seats ($25/seat/month annually) for shared access at higher limits than individual Pro.

    Frequently Asked Questions

    Did the Team plan rate limits actually double in May 2026?

    Yes. Anthropic confirmed the 5-hour rate limit doubled for Team plan subscribers following the SpaceX Colossus 1 compute deal announced May 6, 2026. This applies to both Standard and Premium seats.

    Does peak-hours throttling elimination apply to Team plan?

    The peak-hours throttling elimination was announced specifically for Pro and Max subscribers. Team plan benefits from the doubled rate limit floor; throttling elimination was not announced for Team.

    What happens when I hit a Team plan usage limit?

    Claude notifies you that you’ve reached your usage limit. With the 5-hour rolling window, you can continue once older usage rolls off — you’re not waiting for a midnight reset. Burst usage depletes the window faster than spread usage over the same period.

    Are Claude Sonnet 4 and Opus 4 still available on Team?

    They remain available but retire June 15, 2026. After that date, the active lineup is Opus 4.7, Sonnet 4.6, and Haiku 4.5.

    Does the 1,500% Tier 1 API increase apply to Team plan API usage?

    The 1,500% input and 900% output token increases apply to Tier 1 API customers specifically. Team plan through claude.ai uses the doubled 5-hour window. Both benefits apply in their respective contexts if you’re a Tier 1 API customer and a Team subscriber.

    Is the free plan getting any rate limit improvements?

    No. The free plan was explicitly excluded from all rate limit increases in the May 2026 SpaceX announcement.

  • Claude AI Pricing: Every Plan Explained (Free, Pro, Max, Team, Enterprise)

    Claude AI Pricing: Every Plan Explained (Free, Pro, Max, Team, Enterprise)

    Looking for quick answers? The FAQ version covers every common question directly.

    → Claude Pricing FAQ

    Anthropic’s Claude pricing covers six tiers — Free, Pro, Max 5x, Max 20x, Team, and Enterprise — plus a separate pay-per-token API. Choosing the wrong path can cost you significantly more than necessary. Here’s what each option actually includes in 2026.

    What Are Claude’s Subscription Plans and Prices?

    Claude offers six tiers: Free ($0), Pro ($20/month), Max 5x ($100/month), Max 20x ($200/month), Team (from $20/seat/month billed annually), and Enterprise (custom pricing).

    Plan Price Best For
    Free $0 Casual exploration
    Pro $20/month Individual power users
    Max 5x $100/month Developers hitting Pro limits
    Max 20x $200/month Full-day heavy usage
    Team Standard $20/seat/month (annual) · $25 monthly Collaborative teams
    Team Premium $100/seat/month (annual) · $125 monthly Developer teams needing Claude Code
    Enterprise Custom Large orgs with compliance needs

    What Does the Claude Free Plan Include?

    The Free plan gives you access to Claude on web, iOS, Android, and desktop with no credit card required, subject to rolling usage limits.

    The Free plan gives you access to Claude on web, iOS, Android, and desktop with no credit card required. It includes text, image, and code generation plus web search. Usage limits are intentionally opaque — Anthropic doesn’t publish exact message caps — but limits reset on a rolling 5-hour window. The Free tier is designed for exploration, not sustained daily work.

    Is Claude Pro Worth $20 a Month?

    Pro delivers substantially more usage than Free, plus Claude Code, unlimited projects, the Research feature, and Google Workspace integration — sufficient for most individual developers and writers.

    Pro delivers substantially more usage than Free, Claude Code in the terminal, unlimited projects, the Research feature, file creation, code execution, and Google Workspace integration. Usage still has limits — Anthropic does not publish exact message counts, but heavy sessions will reach the ceiling — but it’s sufficient for most individual developers and writers. Annual billing brings the effective rate to $17/month.

    What Is the Difference Between Claude Max 5x and Max 20x?

    Max 5x ($100/month) gives you 5x Pro’s per-session usage; Max 20x ($200/month) gives you 20x — enough that rate limits stop being a practical concern for full-day development work.

    Max 5x provides 5x Pro’s per-session headroom at $100/month. Max 20x at $200/month delivers 20x Pro usage — enough that rate limits stop being a practical concern for most full-day development work. Both tiers include Claude Code, with access to Claude Opus 4.8 and Sonnet 4.6, and a 1M token context window.

    Extra usage is available on Pro, Max 5x, and Max 20x — when you hit your included limit, you can continue at standard API-rate billing with a spending cap you set.

    How Does Claude Team Plan Pricing Work?

    Team requires a minimum of 5 seats: Standard seats at $20/seat/month billed annually ($25 monthly) include collaboration features but not Claude Code; Premium seats at $100/seat/month billed annually ($125 monthly) add Claude Code for developers.

    Team requires a minimum of 5 seats and comes in two flavors. Standard seats at $20/seat/month billed annually ($25 billed monthly) include 1.25x more usage per session than Pro with a weekly reset, plus collaboration features, central billing, SSO, and Microsoft 365 and Slack integrations. Standard seats do not include Claude Code.

    Premium seats at $100/seat/month billed annually ($125 monthly) add Claude Code, making them the right choice for engineering team members. You can mix Standard and Premium seats within one Team plan — so non-technical staff get Standard while developers get Premium.

    Enterprise Plan — Custom Pricing

    Enterprise is for organizations with compliance, data residency, or governance requirements. It includes access to the full 1M token context window, HIPAA readiness, SAML SSO, domain capture, spend controls, and dedicated support. Based on user reports, pricing starts around $60/seat with a 70-seat minimum, putting the floor near $50,000 annually — contact Anthropic sales for exact figures. Training on customer data is disabled contractually at this tier.

    How Much Does the Claude API Cost Per Token?

    As of May 2026: Claude Sonnet 4.6 costs $3.00 input / $15.00 output per million tokens; Opus 4.6 costs $5.00 / $25.00; Haiku 4.5 costs $1.00 / $5.00.

    The API is entirely separate from subscription plans. You pay per million tokens (MTok) with no monthly minimum. Current rates as of June 10, 2026 (verified June 10, 2026 from Anthropic’s official models page):

    • Claude Opus 4.8: $5.00 input / $25.00 output per MTok
    • Claude Sonnet 4.6: $3.00 input / $15.00 output per MTok
    • Claude Haiku 4.5: $1.00 input / $5.00 output per MTok

    Prompt caching cuts input costs by up to 90% for repeated context. The Batch API processes requests within 24 hours at a flat 50% discount on all tokens — ideal for content pipelines, data enrichment, and any workload where real-time responses aren’t required. As of March 2026, Anthropic eliminated long-context surcharges, so a 900K-token request costs the same per-token rate as a 9K one.

    June 2026 — Professional Services Pricing

    Managed Agents

    Token rates + $0.08/session-hour active runtime. No surcharge for Orchestration or Outcomes (public beta).

    Claude Security Beta

    Included in Enterprise during beta. Powered by Opus 4.8 ($5/$25 per MTok at API rates).

    Claude Mythos Preview

    $25/$125 per MTok. Invitation-only via Project Glasswing.

    → Full Pricing FAQ · Managed Agents pricing deep-dive

    Which Claude Plan Is Right for You?

    Start with Pro for individual use, move to Max 5x if you regularly hit limits, choose Max 20x for full-day heavy use, and use Team for groups of 5+ where Standard seats cover non-technical staff and Premium covers developers.

    Start with Pro if you’re an individual who hits Free limits regularly. Move to Max 5x if you’re a developer doing focused coding sessions. Max 20x makes sense if Claude is your primary tool throughout the workday. For teams, buy Standard seats for non-technical staff and Premium seats for developers who need Claude Code. If you’re building an application or automation that calls Claude programmatically, use the API — subscription plans don’t provide API credits and don’t reduce API costs.

    Claude API Pricing: Pay-Per-Token Rates for Every Model

    The Claude API is priced separately from claude.ai subscriptions. You pay per million tokens (MTok) consumed — input and output priced separately. There is no monthly minimum; you add credits and they deplete as you use the API.

    Model Input (per MTok) Output (per MTok) Context Window
    Claude Opus 4.8 $5.00 $25.00 1M tokens
    Claude Sonnet 4.6 $3.00 $15.00 1M tokens
    Claude Haiku 4.5 $1.00 $5.00 200K tokens

    Prompt caching reduces costs significantly for repeated context: cache write is 25% of base input price, cache read is 10%. The Batch API offers 50% off all models for non-time-sensitive work. For a full breakdown of how to minimize token spend, see Claude on a Budget: the Complete Guide.

    How Does Claude Pricing Compare to GPT-4o and Gemini 2.0?

    Model Input (per MTok) Output (per MTok)
    Claude Sonnet 4.6 $3.00 $15.00
    Claude Haiku 4.5 $1.00 $5.00
    GPT-4o (OpenAI) $2.50 $10.00
    Gemini 2.0 Flash $0.075 $0.30
    Gemini 2.5 Pro $1.25 $10.00

    Claude Sonnet 4.6 sits above GPT-4o on price but competes at or above it on reasoning tasks. Claude Haiku 4.5 is the cost-competitive option for high-volume pipelines. Gemini 2.0 Flash is significantly cheaper for commodity tasks; the trade-off is reasoning depth and context handling on complex documents.

    How Much Does a Claude License Cost for Business?

    A Claude business license is sold per seat: Team Standard seats cost $20/seat/month billed annually ($25 monthly), Team Premium seats with Claude Code cost $100/seat/month billed annually ($125 monthly), with a 5-seat minimum. Enterprise licenses are custom-priced annual contracts.

    License type Annual billing Monthly billing Minimum seats Claude Code
    Team Standard seat $20/seat/month $25/seat/month 5 No
    Team Premium seat $100/seat/month $125/seat/month 5 Yes
    Enterprise license Custom (annual contract — contact sales) ~70 (reported) Yes

    If you’re writing a budget request or procurement document, here are the numbers that matter: a 10-person team with 7 Standard and 3 Premium seats runs $440/month on annual billing — $5,280/year. Licenses are managed centrally with consolidated billing, SSO, and admin controls, and you can mix Standard and Premium seats within one plan. A Claude license covers the claude.ai apps and (on Premium seats) Claude Code; it does not include API credits, which are billed separately per token. There is no perpetual or one-time license option — all Claude licensing is subscription-based.

    How Much Does Claude Code Cost?

    Claude Code has no standalone price — it’s included with Pro ($20/month), Max 5x ($100/month), Max 20x ($200/month), Team Premium seats ($100/seat/month annual), and Enterprise. Alternatively, run it against an API key and pay per token.

    Plan Claude Code included? Usage headroom
    Free No
    Pro ($20/mo) Yes Standard Pro limits — enough for an hour or two of daily coding
    Max 5x ($100/mo) Yes 5x Pro — sustained daily development
    Max 20x ($200/mo) Yes 20x Pro — full-day heavy use and parallel sessions
    Team Standard No
    Team Premium ($100/seat annual) Yes Per-seat developer allocation
    Enterprise Yes (Premium seats) Custom
    API key (pay-per-token) Yes No plan limits — billed at standard model token rates

    For automation — cron jobs, CI pipelines, claude -p scripts — note the June 15, 2026 change: subscription plans get a monthly Agent SDK credit pool (Pro $20, Max 5x $100, Max 20x $200, Team Standard $20/seat, Team Premium $100/seat), with overage billed at API rates. Full details in the Agent SDK dual-bucket billing guide. For the complete tier-by-tier breakdown including API-key economics, see the full Claude Code pricing guide.

    What Are Claude’s Usage Limits and Extra Usage Costs?

    Every Claude plan has usage limits that reset on a rolling 5-hour window, plus weekly caps on paid tiers. When you hit a paid plan’s limit, you can either wait for the reset or buy extra usage at standard API token rates with a spending cap you control.

    Plan Relative usage Reset window Extra usage available?
    Free Baseline (light use) Rolling 5 hours No — upgrade required
    Pro ~5x Free Rolling 5 hours + weekly cap Yes — API rates, capped by you
    Max 5x 5x Pro Rolling 5 hours + weekly cap Yes
    Max 20x 20x Pro Rolling 5 hours + weekly cap Yes
    Team Standard 1.25x Pro per seat Weekly reset Yes (admin-controlled)
    Team Premium Higher, includes Claude Code Weekly reset Yes (admin-controlled)

    Anthropic intentionally doesn’t publish exact message counts — limits are measured in compute, so long conversations, large file uploads, and Opus-heavy sessions consume your window much faster than short Haiku chats. For the full mechanics, see Claude Team plan usage limits and Claude API rate limits.

    Claude Pricing by Country: UK, Australia, India, and Canada

    Anthropic charges the same USD list price in every country — Claude Pro is $20/month worldwide. Your bank converts to local currency, and applicable local tax (VAT or GST) is added at checkout.

    Country Claude Pro (approx. local) Claude Max 5x (approx. local) Tax added at checkout
    United Kingdom ≈ £16/month ≈ £79/month 20% VAT
    Australia ≈ A$31/month ≈ A$153/month 10% GST
    India ≈ ₹1,700/month ≈ ₹8,600/month 18% GST
    Canada ≈ C$27/month ≈ C$137/month GST/HST (5–15% by province)
    New Zealand ≈ NZ$33/month ≈ NZ$166/month 15% GST

    Local-currency figures are approximate conversions at June 2026 exchange rates — your card statement reflects your bank’s rate plus any foreign-transaction fee. There is no region-specific discount pricing for claude.ai plans, and API token rates are likewise USD-denominated everywhere. Prices shown on Anthropic’s pricing page exclude applicable tax.

    Frequently Asked Questions: Claude Pricing

    How much does Claude cost per month?

    Claude costs $0 (Free), $20/month (Pro), $100/month (Max 5x), or $200/month (Max 20x) for individual plans. Team plans start at $20/seat/month (annual billing, 5-seat minimum). API access is pay-per-token with no monthly minimum.

    Is there a free version of Claude?

    Yes. The Free plan gives access to Claude on web, iOS, Android, and desktop with no credit card required. Usage limits apply and reset on a rolling 5-hour window. The Free tier is suitable for light, exploratory use but not sustained daily work.

    What does Claude Pro include at $20/month?

    Pro includes approximately 5x the usage of Free, Claude Code in the terminal, unlimited projects, the Research feature, file creation, code execution, and Google Workspace integration. Annual billing brings the effective rate to $17/month.

    What is the cheapest way to use Claude?

    The Free plan is the cheapest at $0. For API access, Claude Haiku 4.5 at $1 input / $5 output per MTok is the most cost-efficient model. Combined with the Batch API (50% discount) and prompt caching, high-volume workflows can run at a fraction of standard API cost.

    What is Claude Max and is it worth $100–$200 per month?

    Claude Max comes in two tiers: Max 5x at $100/month gives 5x Pro’s per-session usage, and Max 20x at $200/month gives 20x. Max is worth it if you’re hitting Pro limits regularly during development or coding sessions. Both include Claude Code and the full 1M token context window with Claude Opus 4.8 and Sonnet 4.6.

    How does Claude Team pricing work?

    Team plans require a minimum of 5 seats. Standard seats cost $20/seat/month billed annually ($25 monthly) and include collaboration features. Premium seats cost $100/seat/month billed annually ($125 monthly) and add Claude Code — the right choice for developers on the team. You can mix Standard and Premium seats within the same Team plan.

    Does Claude Pro give you access to Claude Opus 4.8?

    Pro gives you access to Claude’s models including Opus 4.8 for complex tasks, Sonnet 4.6, and Haiku 4.5, subject to usage limits. The Max tiers give you significantly more headroom to use Opus 4.8 for extended sessions. For unlimited, predictable API access to Opus 4.8, use the API directly at $5 input / $25 output per million tokens.

    What is the Claude API cost per million tokens in 2026?

    As of June 2026 (verified from Anthropic’s official docs): Claude Opus 4.8 costs $5.00 input / $25.00 output per million tokens; Claude Sonnet 4.6 costs $3.00 input / $15.00 output; Claude Haiku 4.5 costs $1.00 input / $5.00 output. The Batch API offers 50% off all models for non-real-time work.

    Does Claude have a student discount?

    There is no individual self-serve student discount, but Anthropic now offers an Education plan with discounted rates for universities and their members — check whether your institution participates. Otherwise students can use the Free tier without a credit card, and the cheapest paid path is Pro at $17/month with annual billing.

    Can I use Claude without a subscription by paying per use?

    Not directly through claude.ai — the website only offers Free, Pro, Max, and Team subscription plans. Pay-per-use access is available only through the Claude API, which requires a developer account. API pricing starts at $1 input / $5 output per million tokens for Haiku 4.5 with no monthly minimum charge.

    How much does the Anthropic Console (Claude Console) cost?

    The Anthropic Console itself is free — it’s the developer dashboard for managing API keys, tracking usage, and testing prompts in the Workbench. You only pay for the API tokens you consume, starting at $1 input / $5 output per million tokens for Haiku 4.5. You add prepaid credits to get started; there is no monthly platform fee.

    How much is a Claude license for business?

    Claude business licensing is per-seat: Team Standard seats cost $20/seat/month billed annually ($25 monthly), and Team Premium seats with Claude Code cost $100/seat/month billed annually ($125 monthly), with a 5-seat minimum. Enterprise licenses are custom annual contracts. There is no perpetual license — all Claude licensing is subscription-based.

    Does the Claude desktop app cost extra?

    No. The Claude desktop app for Windows and macOS is included with every plan, including Free. Desktop, web, and mobile all share the same account and the same usage limits — there is no separate desktop pricing.

    Is Claude cheaper in India, the UK, or Australia?

    No — Anthropic charges the same USD list price worldwide. Claude Pro is $20/month everywhere; your bank converts it to local currency (roughly £16, A$31, or ₹1,700) and local VAT or GST is added at checkout where applicable. There is no regional discount pricing.

    Is Claude available on Azure, AWS, or Google Cloud?

    Yes. Claude models are available through Amazon Bedrock and the Claude Platform on AWS, Google Cloud’s Vertex AI, and Microsoft Foundry. Cloud-platform pricing is token-based and aligned with Anthropic’s API rates, billed through your existing cloud account — useful if your organization has cloud spend commitments to draw down.

    Does Anthropic offer nonprofit pricing?

    Anthropic doesn’t list a standing nonprofit discount on its pricing page as of June 2026. Nonprofits typically start with Team at standard rates or contact Anthropic sales about Enterprise terms. An Education plan with discounted rates does exist for universities and their members.


    May 2026: Managed Agents & Claude Security Pricing

    Updated June 10, 2026

    Anthropic’s professional services now include Managed Agents and Claude Security. Pricing for both is API-based, not subscription-based.

    Claude Managed Agents Pricing

    Managed Agents pricing follows the standard API token rates for whichever Claude model you use inside the agent pipeline — there’s no separate Managed Agents surcharge on top of model costs. You pay for the tokens the models consume:

    Component Model Used Input / Output per MTok Status
    Multiagent Orchestration Your choice Model rate applies Public beta
    Outcomes Your choice Model rate applies Public beta
    Dreaming (memory refinement) Advisor model (short plan) + executor model Billed separately by role Developer preview

    The Dreaming advisor tool uses a short-plan generation (typically 400–700 tokens) at the advisor model’s rate, while the executor handles full output at its lower rate — keeping combined cost well below running the advisor model end-to-end. Use max_uses to cap advisor calls per request. Requires beta header: anthropic-beta: advisor-tool-2026-03-01. Docs: platform.claude.com/docs/en/managed-agents/dreams

    Claude Security Beta Pricing

    Claude Security is currently in public beta for Enterprise customers. Anthropic has not published a standalone per-scan or per-seat price for Claude Security Beta — access is included as part of Enterprise during the beta period. Underlying model is Claude Opus 4.8 ($5 input / $25 output per million tokens at API rates). For Enterprise pricing including Claude Security, contact Anthropic sales.

    Claude Mythos Preview Pricing (Project Glasswing)

    Claude Mythos Preview is not available via standard API or any subscription tier. Through Project Glasswing (invitation-only, defensive cybersecurity workflows): $25 per million input tokens, $125 per million output tokens. No self-serve access — contact Anthropic for Glasswing information at anthropic.com/glasswing.

    What to do next

    Now that you have the price — here’s how to actually run it

    Knowing the cost is step one. The harder questions are whether Managed Agents is the right architecture for your use case, how it compares to building on the raw API, and what a realistic monthly bill looks like at scale.


    Claude Pricing Calculator (Updated June 10, 2026)

    Use this tool to figure out which Claude plan actually fits your usage, what you’d pay on the API equivalent, and how the new June 15, 2026 Agent SDK billing change affects your costs. All rates verified against Anthropic’s official pricing documentation as of June 10, 2026.

    Tell us how you use Claude





    2 = roughly 30 hours of normal Claude use per month


    Output is typically ~25% of input for chat work


    $ value of unattended Claude work (cron jobs, scripts, GitHub Actions). 0 if you only chat.

    Email me this breakdown

    Get your numbers in your inbox so you can compare plans later — or forward them to whoever approves the budget.



    This calculator uses Anthropic’s published API rates as of June 10, 2026. Subscription pricing reflects current public plans. The Agent SDK monthly credit pool launches June 15, 2026 — Pro $20, Max 5x $100, Max 20x $200, Team Standard $20/seat, Team Premium $100/seat.

    What Claude Actually Costs: Six Worked Examples (June 2026)

    The calculator above is interactive; these are the same calculations worked through for six common usage profiles, using Anthropic’s published rates as of June 10, 2026. API-equivalent figures assume standard rates with no prompt caching or batch discounts.

    Profile Monthly usage Best plan Plan cost API equivalent
    Casual user — questions a few times a week 0.5M in / 0.13M out (Sonnet 4.6) Free, or Pro for headroom $0–$20 ≈ $3.45/mo
    Individual writer or analyst — daily use 2M in / 0.5M out (Opus 4.8) Pro $20 ($17 annual) ≈ $22.50/mo
    Developer — focused daily coding with Claude Code 10M in / 2.5M out (Opus 4.8) Max 5x $100 ≈ $112.50/mo
    Power user — Claude open all day, parallel sessions 30M in / 7.5M out (Opus 4.8) Max 20x $200 ≈ $337.50/mo
    5-person team — 3 non-technical, 2 developers Mixed Team: 3 Standard + 2 Premium $260/mo (annual billing) Varies by usage
    High-volume pipeline — classification or enrichment 50M in / 10M out (Haiku 4.5, Batch API) API direct ≈ $50/mo (after 50% batch discount)

    The pattern: subscriptions beat the API whenever usage is steady and interactive — Pro pays for itself at roughly 2M input tokens a month on Opus 4.8. The API wins for spiky automated workloads, anything that can use the Batch API, and pipelines that run on Haiku 4.5. A reasonable rule of thumb: if your monthly API equivalent lands more than about 50% above a subscription price, take the subscription.

    Next Steps: What to Read After This

    You came here for pricing. Depending on what you actually need to do next, these are the right places to go:

    If you’re deciding whether to subscribe

    Is Claude Free? What You Actually Get Without Paying

    Walk through the free tier limits and decide if you need to pay at all.

    If you’re working at a team or company

    Claude Team Plan: When to Upgrade and What You Get

    Per-seat pricing, shared usage limits, admin controls, and when Team beats individual Pro.

    If you’re running automation or scripts

    Claude Agent SDK Dual-Bucket Billing: What Changes June 15, 2026

    The new Agent SDK credit pool, what it covers, and what to do before the cutover.

    If you want to actually start building

    Anthropic Console: The Complete Guide to Getting Started

    Set up an API key, navigate the console, and run your first request.

    If you’re a student looking to save

    Claude Student Discount: The Honest Guide to Getting Claude for Less

    No public student discount exists, but here are the legitimate paths to free or reduced access.

    If you’re choosing which model to use

    Claude Models Roadmap May 2026: Opus 4.8, Knowledge Cutoffs, the 1M Context Window

    The current lineup, what each tier costs, and what’s actually verified about Claude 5.

    For the broader operating philosophy of how Claude fits alongside the rest of a working AI stack, see The Three-Legged Stack: Why I Run Everything on Notion, Claude, and Google Cloud.

  • Claude Student Discount: The Honest Guide to Getting Claude for Less (May 2026)

    Claude Student Discount: The Honest Guide to Getting Claude for Less (May 2026)

    Last refreshed: May 15, 2026

    May 2026 Update — Free Plan Left Behind

    Anthropic’s May 2026 SpaceX rate limit increase (doubled 5-hour limits, eliminated peak-hour throttling) explicitly excluded the free plan. If you’re on free and hoping the latest compute expansion helped, it didn’t. This update explains what that means practically and what your actual options are.

    The May 2026 Update: Free Plan Was Explicitly Left Out

    When Anthropic announced doubled rate limits following the SpaceX Colossus 1 compute deal (May 6, 2026), they were specific: the increases apply to Pro, Max, Team, and seat-based Enterprise. The free plan was explicitly excluded.

    This matters for the student/budget conversation because:

    • Free plan rate limits stayed exactly where they were — no improvement
    • The gap between what free users can do and what paid subscribers can do just widened
    • Peak-hours throttling elimination applies to Pro and Max only — not free
    • Claude Code access remains unavailable on free

    If you were waiting to see if Anthropic would upgrade free tier limits alongside the major infrastructure expansion — the answer is no. The business decision is clear: compute improvements go to paying customers first, and the free tier stays constrained to drive conversion.

    What This Means If You’re a Student Trying to Use Claude Free

    You can still use Claude on the free tier. The model you access is capable — Anthropic hasn’t crippled it. What you’re constrained by is how much you can use it before hitting a limit, and how fast it responds during peak hours. Both of those constraints worsened relative to paid tiers in May 2026, because paid tiers got better while free stayed the same.

    For light usage — occasional questions, single documents, short projects — free is still viable. For sustained daily use, research workflows, or anything involving long documents and multiple sessions, free will slow you down in ways that affect your work.

    Quick Answer

    There is no official Claude student discount. Claude Pro costs $20/month for everyone. However, there are three legitimate paths to reduced or free access for students — and one of them covers most student use cases completely.

    The Three Ways Students Actually Get Claude for Less

    Best for most students
    Claude Free Tier
    Access to Claude Sonnet 4.6 with daily usage limits. Sufficient for essay drafting, coding help, summarization, and research. No credit card required. Limits reset daily.
    $0/month — no card needed
    University programs
    Claude for Education
    Anthropic has institutional agreements with select universities. If your school has a deal, access may be included in your student account. Check with your IT department or university library — coverage is expanding but not universal.
    Free if your school participates
    API credits
    GitHub Student Developer Pack
    GitHub’s student pack periodically includes credits for AI tools and APIs. Availability changes — check current offers at education.github.com. Requires a .edu email or institutional verification.
    Variable — check current offers
    Full access
    Claude Pro — $20/month
    5x more usage than free, priority access during peak hours, access to Claude Opus 4.7 for complex tasks. No student discount, but the free tier covers most student workloads without it.
    $20/month — no discount available

    What the Free Tier Actually Gets You

    Most students overestimate how much Claude Pro they need. The free tier handles:

    • Essay feedback and drafting assistance
    • Coding help — debugging, explaining concepts, generating boilerplate
    • Research summarization — paste an article or paper, get a structured summary
    • Math and problem-set walkthroughs
    • Study guide generation from lecture notes

    Where you’ll hit limits: long research sessions on a single topic, processing multiple long documents in the same conversation, or high-volume API access for a class project. For those cases, Claude Pro or API credits are the right call.

    Claude for Education — Current Status

    Anthropic’s education program is expanding but not yet universal. The fastest way to find out if your institution participates is to email your university’s IT department or check whether your library already has a Claude subscription that extends to students.

    Harvard, for example, replaced ChatGPT Edu with Claude in 2026 — so institutional deals are happening. If your school hasn’t moved yet, it may soon.

    What Claude Pro Is Actually Worth for Students

    If you’re doing intensive AI-assisted work — a thesis, a capstone project, a research paper that requires synthesizing many sources — $20/month is reasonable for a semester. Many students find they need it for two or three months out of the year and can drop to free for the rest.

    There’s no annual commitment required. You can subscribe month-to-month and cancel when the project is done.

    Bottom Line

    Start with the free tier. It covers the majority of student use cases. If you hit the limit consistently, check whether your university has an institutional deal before paying. If neither works for your project, Claude Pro at $20/month is month-to-month with no lock-in.

    For teams making a buying decision

    Evaluating Claude for a team — not just yourself?

    If you’re working through the plan decision for a business or agency, the calculus is different than individual use. We’ve run this math across 20+ client accounts and can tell you exactly where the API breaks even vs. subscription, and which plan structure makes sense for your workload.

    Get a plan recommendation →

  • Claude for Legal: How Law Firms Are Using AI to Cut Research Time, Draft Faster, and Bill Smarter

    Claude for Legal: How Law Firms Are Using AI to Cut Research Time, Draft Faster, and Bill Smarter

    Last refreshed: May 15, 2026

    Law firms have always been early adopters of tools that compress billable time. Document review software. Legal research databases. E-discovery platforms. The pattern is consistent: the firms that adopt early capture the margin advantage, and the rest catch up at cost.

    Claude is following that pattern. And the window where using it is a competitive advantage rather than table stakes is closing faster than most legal professionals realize.

    This is a practical guide to where Claude actually delivers in legal work — not theoretical use cases, but the specific tasks where it earns its keep — and where you still need a human in the loop.

    Where Claude Delivers the Most Value in Legal Practice

    Legal Research and Case Law Summarization

    The highest-leverage use case for most attorneys is research compression. Claude can take a 40-page appellate decision and return a structured summary — holding, reasoning, key facts, dissent — in under 60 seconds. It can synthesize across multiple cases to identify how a circuit has treated a specific doctrine over time.

    What it cannot do: verify citations autonomously or guarantee it has not hallucinated a case name. Every citation must be independently verified in Westlaw or Lexis before it goes into a brief. Claude is the first pass, not the final check.

    Practical workflow: paste the full text of the opinion (Claude’s 200K context window handles most decisions comfortably), ask for a structured summary with specific fields — holding, key facts, procedural posture, distinguishing factors — and use that as the basis for your own analysis rather than the analysis itself.

    Contract Drafting and Redlining

    Claude handles first-draft contract language well, particularly for standard commercial agreements where the structure is predictable: NDAs, MSAs, employment agreements, vendor contracts. Give it the deal terms and the governing law, and it produces a serviceable first draft that your attorney then marks up rather than writing from scratch.

    For redlining, paste the counterparty’s draft and ask Claude to identify provisions that deviate from market standard, flag missing protections, or summarize the risk profile of specific clauses. It catches things that get missed at 11pm on a deal close.

    The limitation: Claude does not know your client’s specific risk tolerance, industry norms for your particular market, or the negotiating history with this counterparty. Those judgment calls remain human work.

    Deposition and Discovery Preparation

    One of the most underused legal applications is using Claude to prepare for depositions. Feed it the deponent’s prior testimony, relevant documents, and the key issues in the case. Ask it to generate a question outline organized by theme, flag inconsistencies in prior statements, and identify documents to confront the witness with.

    It can also process large document productions and summarize by custodian, date range, or topic — substantially reducing the time a paralegal or junior associate spends on initial review.

    Client Communication and Memo Drafting

    Client-facing memos — explaining a legal issue in plain language, summarizing a court ruling’s implications, drafting a status update — are exactly the kind of writing where Claude performs well and where attorneys often underinvest time. The work is important but not intellectually complex. Claude produces a solid draft; the attorney reviews, adjusts for client relationship context, and sends.

    What Claude Cannot Do in Legal Work

    • It cannot verify citations. It will hallucinate case names and citations with confidence. Every citation must be checked against an authoritative legal database.
    • It cannot provide legal advice. It produces language and analysis, not professional judgment. The attorney exercises judgment; Claude compresses the work that precedes it.
    • It does not know current law. For recent statutory changes, new regulations, or fresh precedent, you need current research tools.
    • It lacks client context. Claude does not know your client’s history, risk appetite, or the relationship dynamics that shape legal strategy.
    • Confidentiality considerations apply. Before pasting client documents into any AI tool, your firm needs a clear policy on what data is permissible to process externally and under what terms.

    Getting Claude Set Up for Legal Work

    The most effective legal deployment of Claude is not the chat interface — it is Claude with a strong system prompt that establishes context, format expectations, and guardrails. A system prompt for a litigation practice might specify the governing jurisdiction, output format requirements, what it should flag for attorney review, and firm-specific terminology.

    For firms with technical capacity, Claude’s API allows integration directly into document management systems, allowing attorneys to invoke Claude without leaving the tools they already use.

    The Billing Question

    The elephant in the room for law firms considering AI adoption is the billing model. If Claude compresses a five-hour research task to one hour, do you bill five hours or one?

    The firms navigating this well are shifting toward value billing and fixed-fee arrangements where efficiency is profit rather than a billing problem. The ABA and state bars are actively developing guidance on AI use and disclosure. Following your jurisdiction’s bar guidance and staying current on disclosure requirements is non-negotiable.

    Bottom Line

    Claude does not replace legal judgment. It compresses the work that precedes judgment — research, drafting, review, summarization — at a quality level that makes it worth building into the workflow of any firm serious about efficiency. Pick one task category, run Claude against your next ten instances of that task, and measure the time delta. The ROI case makes itself.

  • Anthropic at Scale: 5 Gigawatts, $30B Revenue Run Rate, and What the Infrastructure Bet Means

    Anthropic at Scale: 5 Gigawatts, $30B Revenue Run Rate, and What the Infrastructure Bet Means

    Last refreshed: May 15, 2026

    Three data points published in the last two weeks of April 2026 define the scale at which Anthropic is now operating: a 5-gigawatt compute capacity commitment from Amazon announced April 20, a disclosed $30 billion annual revenue run rate (up from $9 billion at the end of 2025), and a customer base of more than 1,000 enterprises spending over $1 million per year. Taken together, they describe a company that has crossed the threshold from frontier AI lab to large-scale enterprise infrastructure provider.

    The Amazon Compute Commitment

    Five gigawatts of committed compute capacity is a number that requires context to land properly. For reference, a large data center campus typically consumes 100–500 megawatts. Five gigawatts is the equivalent of 10–50 large data center campuses worth of compute, committed to a single AI company. This is infrastructure at a scale that was historically reserved for hyperscalers building general-purpose cloud platforms — not AI model providers.

    The Amazon partnership is part of a broader compute story that also includes Google and Broadcom’s multi-gigawatt TPU partnership (announced April 6, with capacity launching in 2027). Anthropic is not building this infrastructure itself — it’s securing committed capacity from the two largest cloud providers simultaneously, which is a different and arguably more capital-efficient strategy than building proprietary data centers.

    Revenue: $9B to $30B in One Quarter

    The jump from $9 billion to $30 billion annualized run rate between end of 2025 and April 2026 is the most striking number in the disclosure. That’s not organic growth — that’s a step change that implies either a major enterprise contract cohort closing in Q1 2026, the Cowork and Claude Code adoption curves hitting inflection simultaneously, or both. The 1,000+ customers at $1 million+/year figure is consistent with enterprise adoption at scale: at $1 million average, 1,000 customers represents $1 billion in ARR from that cohort alone.

    For context on what $30 billion run rate means competitively: OpenAI disclosed approximately $3.7 billion in annualized revenue in mid-2024. If Anthropic’s figure is accurate and current, it suggests the competitive landscape has shifted more dramatically than most public coverage has reflected.

    What This Means for Enterprise Buyers

    Enterprise procurement teams evaluating AI vendors weigh financial stability heavily. A vendor that might not exist in 18 months is a vendor you don’t build critical workflows on. The combination of $30 billion run rate, 5 gigawatts of committed compute, and 1,000+ million-dollar customers removes the financial stability objection from the Anthropic procurement conversation in a way that a year ago it couldn’t.

    The Raj Narasimhan board appointment (April 14) is a governance signal in the same direction. Board composition at this revenue scale shapes how enterprise legal and compliance teams assess vendor risk. A mature board with enterprise-credible governance is a procurement unlock, not just a PR announcement.

    The Capacity Question

    The Google/Broadcom TPU capacity doesn’t launch until 2027. The Amazon commitment is a forward contract, not immediately available infrastructure. This means Anthropic is building compute capacity commitments ahead of demand — the right bet if the revenue trajectory continues, a costly overcommit if it doesn’t. The 2027 capacity launch timing will be worth watching against the actual demand curve that develops over the next 12 months.

    Source: Anthropic News