What is the Chrome WebMCP API and why does it matter for businesses?

The Chrome WebMCP API brings the Model Context Protocol natively into the browser, allowing web applications to expose structured tool interfaces that AI models can call directly. For businesses running web-based tools or WordPress sites, this means existing applications can become AI-accessible without a full backend rebuild.

What is the WordPress 7.0 Abilities API?

The WordPress 7.0 Abilities API provides agent-to-agent communication endpoints, allowing AI-powered systems to negotiate capabilities and permissions directly with a WordPress installation, transforming it into AI-native infrastructure.

What does AI passing the Turing test mean for content creators?

A UC San Diego study published in PNAS found that GPT-4.5 and Llama-3.1-405B both passed a three-party Turing test in 2026, with GPT-4.5 identified as human 73% of the time. For content creators, this permanently raises the quality bar — the only content that wins is content with elements AI cannot fake: lived experience, proprietary data, and first-person field reports.

What is Generative Engine Optimization (GEO)?

GEO is the practice of structuring web content so AI systems cite, reference, and recommend it. It involves entity enrichment, JSON-LD schema, authoritative citations, and machine-readable formatting targeting large language models rather than traditional search crawlers.

How should small businesses approach AI infrastructure in 2026?

Move from using AI as a productivity tool to building it into operational infrastructure. Start with structured data and schema markup, AI-optimized content pipelines, LLMS.txt crawlability, and agentic workflows where AI handles multi-step tasks autonomously.

What is a custom schema engine and why build one?

A custom schema engine is a purpose-built WordPress plugin generating structured data tailored to AI citation optimization. Unlike generic schema plugins, it outputs precisely the entity relationships and speakable content markers AI systems use when deciding which sources to cite.

What is the significance of AI reviewing AI-written code?

When Claude Fable 5 peer-reviewed code written by Claude Opus for a production plugin, it demonstrated a mature AI development workflow where different model tiers serve different roles — one for generation, another for QA — at machine speed and cost.

Can I use Claude Fable 5 with the Claude API?

Yes. Claude Fable 5 is available through the Anthropic API. It supports streaming, tool use, prompt caching, vision, the Batch API, and MCP server integration.

When should I use Fable 5 vs Sonnet for content production?

For most content production, Sonnet 4.6 is the right call at 3.3× less cost. Reach for Fable 5 when you need to synthesize across a very large corpus, when the content requires deep domain reasoning with extended thinking, or when the task involves both large-context ingestion and complex output generation in a single pass.

Is Claude Fable 5 available in Claude.ai?

Fable 5 is available in Claude.ai on the Pro and Team plans, selectable from the model picker. For production-scale usage, the API with Batch API discounts is more economical.

What is the best way to reduce Claude Fable 5 API costs?

Three levers: model routing (only use Fable 5 when needed), prompt caching (cache reads cost $1.00/M vs $10.00/M), and Batch API (50% discount on non-real-time workloads). Stacking all three can reduce effective costs by 85–95%.

Tag: AI Comparison

Azure Neural TTS vs Google Cloud Text-to-Speech: Audio Versions of Every Article

Adding an audio version of every article is one of those low-effort, high-leverage moves: it makes your content accessible to people who’d rather listen, it gives you a “play this article” widget that lifts time-on-page, and the audio file itself becomes another thing search and assistants can surface. The work is entirely automated — text goes in, an MP3 comes out — so the only real decisions are which voice sounds least like a robot and which free tier covers your back catalog.

We auto-generate audio versions of the same articles on both Azure Neural TTS and Google Cloud Text-to-Speech, on the free tiers, and listen. Short answer: this one’s an honest toss-up. Both produce genuinely natural neural voices, both give you SSML control, and both run our audio pipeline for $0/month. Azure’s free tier is 500,000 characters/month (~60–80 article audio versions of neural voices); Google’s is 1,000,000 characters/month of Standard voices and 1,000,000 characters/month of WaveNet/Neural2 premium voices. Pick by ecosystem and by which voice you’d rather hear.

This is the breakdown from the running lab on tygart.media — voice naturalness, SSML control, voice variety, free ceilings, and the accessibility/SEO payoff.

The free-tier ceilings

How we do it

	Azure	Google Cloud	Verdict
Free neural/premium chars/month	500,000 (Neural)	1,000,000 (WaveNet/Neural2)	Google — 2× headroom
Free standard chars/month	n/a (neural is the tier)	1,000,000 (Standard)	Google on raw volume
Roughly how many article audios	~60–80 neural/mo	~140 premium/mo	Google
Always-free	Yes	Yes	Tie
Our actual bill	$0	$0	Tie where it counts

A 1,200-word article runs around 6,500–7,000 characters, so Azure’s 500K neural budget covers roughly 60–80 full article audio versions a month, and Google’s 1M premium budget covers roughly twice that. For a publisher shipping a handful of articles a week, both stay free with room to spare — the 2× gap only bites if you’re voicing a large back catalog in one go.

Voice quality and SSML control

This is where you actually choose, and it’s genuinely close.

How we do it

	Azure	Google Cloud	Verdict
Voice naturalness	Excellent, very expressive	Excellent, very natural	Tie — both clear the “robot” bar
Voice variety	Huge neural catalog, many styles	Large WaveNet/Neural2 catalog	Slight edge Azure on styles
Speaking styles / emotion	Yes (cheerful, newscast, etc.)	More limited emotional styles	Azure
SSML control	Full SSML + style/prosody tags	Full SSML	Azure, slightly
Custom voice	Yes (custom neural voice)	Yes (custom voice)	Tie
Languages / locales	140+ locales	50+ languages, many voices	Azure on locale breadth

Both clear the bar that matters: neither sounds like a 2010-era text-to-speech engine, and a casual listener wouldn’t immediately clock either as synthetic. Azure edges ahead on expressiveness — its neural voices support named speaking styles (newscast, cheerful, empathetic) that are perfect for an article read-aloud, and its SSML supports fine prosody control. Google’s Neural2 voices are beautifully natural and, to some ears, a touch warmer; the emotional-style controls are just a little thinner.

The accessibility and SEO payoff

The audio isn’t only a nice-to-have. It does real work.

How we do it

	Azure	Google Cloud	Verdict
Accessibility win	Listen instead of read	Listen instead of read	Tie
Output format	MP3 / WAV / streaming	MP3 / LINEAR16 / OGG	Tie
Pipeline integration	REST + SDKs	REST + SDKs	Tie
Time-on-page lift	Audio widget keeps people on page	Same	Tie

An audio version gives screen-reader users and “I’d rather listen” users a first-class way to consume the piece, and the on-page player tends to lift dwell time — a signal that doesn’t hurt. The mechanics are identical on both clouds: feed text, get an MP3, embed it.

What surprised us

Both are genuinely good now. We expected one to clearly win on naturalness and neither did — the synthetic-voice era is over on both clouds.
Azure’s speaking styles are the sleeper feature. Being able to render an article in a “newscast” or “cheerful” style without writing prosody by hand made the read-alouds noticeably more engaging.
Google’s free character budget is the bigger one. 1M premium characters is real headroom; if you’re voicing a back catalog, that matters more than a half-point of naturalness.
The MP3s are interchangeable. Once embedded, listeners couldn’t reliably tell which cloud voiced which article in a blind test we ran on ourselves.

The takeaway

Pick Azure Neural TTS if you want maximum expressiveness — named speaking styles, fine prosody control, and the broadest locale catalog — and your Microsoft ecosystem is already where the rest of your stack lives. The 500K free characters cover a normal publishing cadence comfortably.

Pick Google Cloud Text-to-Speech if you want the larger free character budget (1M premium) for voicing a big back catalog, or you simply prefer the warmth of the Neural2 voices, and your stack is GCP-centric.

For us this is the rare comparison with no loser. We run the pipeline on whichever cloud the rest of that article’s workflow already lives on — and the listener can’t tell the difference either way.

This is part of our “Two Clouds, One Site” series — we run the same media property on both Azure and Google Cloud on the free tiers, generating audio versions of the same articles on each to hear where the voices differ. The lab lives on tygart.media; the findings publish here.

Frequently asked questions

How many free characters do Azure and Google text-to-speech give you per month?
Azure Neural TTS gives 500,000 free neural characters per month, which is roughly 60–80 article audio versions. Google Cloud Text-to-Speech gives 1,000,000 free Standard characters and 1,000,000 free WaveNet/Neural2 premium characters per month, roughly double Azure’s premium headroom. Both stay free for a normal publishing cadence.

Which text-to-speech sounds more natural, Azure or Google?
Both produce genuinely natural neural voices, and in blind listening neither clearly wins. Azure edges ahead on expressiveness with named speaking styles like newscast and cheerful, while Google’s Neural2 voices are very natural and, to some ears, slightly warmer. The synthetic-robot problem is solved on both.

Can I auto-generate an audio version of every blog post for free?
Yes. Both clouds expose a simple REST API that turns article text into an MP3, and their free character budgets cover a typical few-articles-a-week cadence at $0. Google’s larger free budget is better if you want to voice a big back catalog in one pass.

Does Azure Neural TTS support SSML and speaking styles?
Yes. Azure supports full SSML plus named speaking styles (newscast, cheerful, empathetic and more) and fine prosody control, which makes article read-alouds noticeably more engaging. Google also supports full SSML, but its emotional-style controls are thinner.

Does adding an audio version of articles help accessibility and SEO?
Yes. An audio version gives screen-reader and listen-first users a first-class way to consume the content, improving accessibility, and the on-page audio player tends to lift time-on-page, which is a positive engagement signal. The benefit is identical whether you generate the audio on Azure or Google.

July 3, 2026

Azure Translator vs Google Cloud Translation: 2M Free Characters, Tested

Translating your content is one of the cheapest ways to multiply its reach — every article becomes five articles the moment you ship it in five languages. The catch is that machine translation is metered by the character, and a content pipeline burns characters fast. So the real question for a bootstrapped publisher isn’t “which engine is best?” — it’s “which free tier lets me run a multilingual pipeline forever without ever seeing a bill?”

We translate the same articles into multilingual variants on both Azure Translator and Google Cloud Translation, on the free tiers, and watch where each one runs out. Short answer: for a perpetual $0 pipeline, Azure Translator wins on the ceiling — its free tier is 2,000,000 characters/month and it’s always free, which is roughly 300 article-length translations a month. Google Cloud Translation gives you a generous-but-capped 500,000 characters/month and then it’s paid, and it earns its keep on quality and language coverage.

This is the breakdown from the running lab on tygart.media — free ceilings, translation nuance, document vs text, and which one we actually point the pipeline at.

The free-tier ceilings

This is the headline difference, and it’s not close.

How we do it

	Azure	Google Cloud	Verdict
Free characters/month	2,000,000, always free	500,000, then paid	Azure — 4× the ceiling
Roughly how many articles	~300 article translations/mo	~75 article translations/mo	Azure
What happens at the cap	Pay-as-you-go kicks in	Pay-as-you-go kicks in	Tie (mechanism)
Always-free vs 12-month trial	Always free	Always free (the 500K is perpetual)	Tie
Fit for a perpetual pipeline	Excellent	Tight	Azure

The math is the whole story. A typical 1,200-word article is around 6,500–7,000 characters. Translate it into five languages and you’ve spent ~35,000 characters on one article. Azure’s 2M ceiling absorbs dozens of articles across multiple languages every month without a cent; Google’s 500K runs dry after a couple of weeks of the same cadence. If your single hard constraint is “never pay for translation,” Azure is the answer before you even look at quality.

Translation quality and nuance

Free ceilings decide whether you can run the pipeline. Quality decides whether you should publish what comes out.

How we do it

	Azure	Google Cloud	Verdict
Engine	Neural MT, custom models available	Neural MT (NMT), strong general model	Slight edge Google on nuance
Idiom / register handling	Good, occasionally literal	More natural on idioms and tone	Google
Technical terminology	Reliable, customizable glossary	Reliable	Tie
Custom/glossary control	Custom Translator + dictionary	Glossary + AutoML (paid)	Azure on free customization
Major-language quality	Excellent both ways	Excellent both ways	Tie

On high-resource languages — Spanish, French, German, Portuguese — both engines produce output we’d publish with a light editorial pass. Google has a slight edge on idiom and register: it tends to “sound like a person” a beat more often, especially on conversational copy. Azure closes most of that gap with Custom Translator and inline dictionaries, which let you pin brand terms and preferred phrasings — and those customization tools are usable inside the free workflow.

Language coverage and document mode

How we do it

	Azure	Google Cloud	Verdict
Languages supported	100+	100+ (NMT subset varies)	Tie
Long-tail / low-resource	Broad	Broad, often strong	Google, slightly
Document translation	Yes (preserves formatting)	Yes (separate API surface)	Tie
Text translation API	Simple REST	Simple REST	Tie
Batch throughput	High	High	Tie

Both clouds clear 100 languages, so coverage isn’t a deciding factor for a Western-market content site. Document translation — feeding in a formatted file and getting the same layout back in another language — exists on both; we mostly use plain text translation because our content is markdown and we re-render it ourselves.

What surprised us

The character ceiling, not the quality, is the real constraint. We went in expecting a quality shootout and came out realizing that for a content pipeline, “2M free vs 500K free” decides the workflow long before anyone compares a single sentence.
Azure’s always-free 2M is genuinely always free. It’s not a 12-month trial that lapses into charges — it resets every month indefinitely. That’s rare enough that we double-checked it.
Google’s output reads slightly more human on conversational copy. For marketing-voice pieces we noticed Google needed less editorial cleanup; for technical articles the two were indistinguishable.
Glossaries matter more than the base engine. Once you pin your brand and product terms, the gap between the two narrows to almost nothing.

The takeaway

Pick Azure Translator if your priority is a perpetual multilingual content pipeline that never bills you — the 2M-character always-free ceiling is built for exactly this, and Custom Translator gives you brand-term control for free. It’s our default for high-volume article translation.

Pick Google Cloud Translation if quality on conversational, idiom-heavy copy is your top concern and your volume fits comfortably under 500K characters/month — its NMT output tends to need a lighter editorial pass.

For us, running the same site on both clouds, the translation pipeline lives on Azure: at our cadence we’d blow through Google’s free tier in two weeks, and Azure’s ceiling means the multilingual variants ship at $0, month after month.

This is part of our “Two Clouds, One Site” series — we run the same media property on both Azure and Google Cloud on the free tiers, translating the same articles on each to see where the ceilings really sit. The lab lives on tygart.media; the findings publish here.

Frequently asked questions

How many free characters do Azure Translator and Google Cloud Translation give you per month?
Azure Translator’s free tier is 2,000,000 characters per month and it’s always free, resetting every month indefinitely. Google Cloud Translation’s free tier is 500,000 characters per month, after which you pay per character. For a content pipeline, Azure’s ceiling is roughly four times larger.

Which machine translation is more accurate, Azure or Google?
Both use neural machine translation and produce publish-quality output on major languages. Google has a slight edge on idiom, tone, and conversational register, while Azure closes most of that gap with its free Custom Translator and dictionary features. For technical content the two are hard to tell apart.

Can I run a multilingual website translation pipeline for free?
Yes. Azure Translator’s 2,000,000 free characters per month is enough for roughly 300 article-length translations, which covers a typical publishing cadence across several languages at $0. Google’s 500,000 free characters works for lower-volume sites but runs out faster at the same pace.

Does Azure Translator support document translation that keeps formatting?
Yes. Azure offers a document translation mode that preserves the original layout and formatting of files, alongside a simple text translation REST API. Google Cloud Translation offers document translation too. We mostly use plain text translation because our content is markdown that we re-render ourselves.

How many languages do Azure Translator and Google Cloud Translation support?
Both support more than 100 languages, so coverage is rarely the deciding factor for a Western-market site. Google sometimes edges ahead on lower-resource languages, but for common European and Latin American languages the two are equivalent in reach.

July 3, 2026

Claude Fable 5 Pricing and Access (2026)

Last verified: June 13, 2026

Claude Fable 5 (claude-fable-5) is Anthropic’s most capable widely released model, built for the most demanding reasoning and long-horizon agentic work. On the Claude API it is priced at $10 per million input tokens and $50 per million output tokens — double the rate of Claude Opus 4.8 — with a 1M-token context window and up to 128K output tokens per request. It reached general availability on June 9, 2026. The verified pricing and access details are below.

Pricing at a glance

All figures below are from Anthropic’s official pricing and models pages. Prices are in USD per million tokens (MTok). Fable 5 includes the full 1M-token context window at standard pricing — there is no long-context premium.

Item	Claude Fable 5
Model ID (API)	`claude-fable-5`
Base input	$10 / MTok
Output	$50 / MTok
5-minute cache write	$12.50 / MTok
1-hour cache write	$20 / MTok
Cache hit / read	$1 / MTok
Batch API input / output	$5 / MTok · $25 / MTok
Context window	1M tokens
Max output	128K tokens

How Fable 5 compares to Opus, Sonnet, and Haiku

Fable 5 sits at the top of Anthropic’s lineup, a tier above the Opus models. The per-token cost difference is the clearest way to see where it fits.

Model	Input $/MTok	Output $/MTok	Context	Max output
Claude Fable 5	$10	$50	1M	128K
Claude Opus 4.8	$5	$25	1M	128K
Claude Sonnet 4.6	$3	$15	1M	64K
Claude Haiku 4.5	$1	$5	200K	64K

Where you can use Fable 5

At general availability, Fable 5 is offered across Anthropic’s first-party API and all major cloud platforms, plus claude.ai subscription plans (subject to the access note below). The model IDs differ by platform.

Surface	Availability / model ID
Claude API (first-party)	Generally available — `claude-fable-5`
Claude Platform on AWS	Generally available — `claude-fable-5`
Amazon Bedrock	Generally available — `anthropic.claude-fable-5`
Google Vertex AI	Generally available — `claude-fable-5`
Microsoft Foundry	Generally available
claude.ai — Pro, Max, Team, Enterprise	Promotional access June 9–22, 2026 (see below)
claude.ai — Free plan	Not included

Consumer-plan access and the promotional window

For claude.ai subscribers, Anthropic launched Fable 5 with a time-limited promotion rather than a permanent plan inclusion. From June 9 through June 22, 2026, Fable 5 was included on the Pro, Max, Team, and seat-based Enterprise plans at no extra charge. During that window, Anthropic’s documentation states that Fable 5 usage “counts toward your plan’s usage limits, and you won’t be charged anything extra,” but that it draws from those limits “at a higher rate than other models.” The Free plan was explicitly excluded.

Anthropic’s announced plan was that after June 22, 2026, Fable 5 would no longer be included in plan usage limits, and continued use on claude.ai would require usage credits — a pay-as-you-go balance for usage beyond what a plan includes.

Integration notes that affect cost and handling

Fable 5 differs from the Opus, Sonnet, and Haiku models in a few ways that matter when you wire it into an application. It ships with safety classifiers that can decline a request: when that happens, the Messages API returns stop_reason: "refusal" as a successful HTTP 200 response, not an error. You are not billed for a request that is refused before any output is generated, and Anthropic provides server-side, client-side, and manual fallback paths to retry on another Claude model. Adaptive thinking is always on (thinking: {"type": "disabled"} is not supported), and the raw chain of thought is never returned — thinking.display controls whether thinking blocks contain a summary or are empty. Fable 5 also uses the tokenizer introduced with Opus 4.7, which can produce roughly 30–35% more tokens for the same text than older models, so re-baseline your token counts rather than assuming parity with earlier Claude models.

How much does Claude Fable 5 cost?

On the Claude API, Fable 5 costs $10 per million input tokens and $50 per million output tokens. Prompt-cache writes are $12.50/MTok (5-minute) or $20/MTok (1-hour), cache reads are $1/MTok, and the Batch API halves the rate to $5/MTok input and $25/MTok output.

Is Fable 5 more expensive than Claude Opus 4.8?

Yes. Fable 5 is priced at exactly double Opus 4.8 on both input ($10 vs $5 per MTok) and output ($50 vs $25 per MTok). Both share a 1M-token context window and 128K max output.

Which claude.ai plans include Fable 5?

From June 9 to June 22, 2026, Fable 5 was included on the Pro, Max, Team, and seat-based Enterprise plans at no extra cost, drawing from plan usage limits at a higher rate. The Free plan was not included. Anthropic’s plan was to move continued claude.ai use to usage credits after June 22.

What is the difference between Fable 5 and Mythos 5?

They share the same specs ($10/$50 per MTok, 1M context, 128K output) and June 9, 2026 launch date. Fable 5 is the generally available model with built-in safety classifiers that can decline requests; Mythos 5 is offered only in limited availability.

June 13, 2026

Claude Cowork vs Code vs Agent SDK vs Managed Agents (2026)

Last verified: June 13, 2026

Anthropic ships four distinct ways to put Claude to work as an agent, and they are easy to confuse. The short version: Claude Cowork and Claude Code are interactive products billed through your Claude subscription â€” Cowork for knowledge work in the desktop app, Code for software work in your terminal, IDE, desktop, or browser. The Claude Agent SDK and Managed Agents are programmatic surfaces for developers, billed through the API: the Agent SDK is a Python/TypeScript library that runs the agent loop inside your own process, while Managed Agents is a REST API where Anthropic runs the loop and hosts the sandbox. The tables below give the verified, side-by-side breakdown.

The decision matrix

Each row is one surface. Read across for who it serves, whether you drive it turn-by-turn or hand it a goal, where the work executes, and how it is paid for.

Surface	Who it is for	Interactive vs autonomous	Where it runs	How it is billed
Claude Cowork	Knowledge workers (non-developers) â€” research, documents, file and spreadsheet work	Interactive, supervised â€” shows you the plan and waits for your approval before acting	The Claude desktop app on your own computer (macOS or Windows); not available on web or mobile	Claude subscription (Pro, Max, Team, Enterprise) â€” draws from your plan’s usage allocation
Claude Code	Developers doing interactive coding â€” build features, fix bugs, automate dev tasks	Interactive â€” you drive it in a session, though it can run agentically across files and tools	Your machine (terminal, VS Code, JetBrains, desktop app) or the browser at claude.ai/code	Claude subscription or an Anthropic Console (API) account
Claude Agent SDK	Developers building custom agents programmatically (Python or TypeScript)	Autonomous â€” Claude reads files, runs commands, and edits code on its own via the agent loop	Your own process and infrastructure	API key (pay-as-you-go credits); see the subscription note below for the June 15, 2026 change
Managed Agents	Developers running production or long-running agents without operating their own sandbox/session infrastructure	Autonomous â€” you send events, Claude executes tools and streams back results	Anthropic-managed cloud sandbox per session (or a self-hosted sandbox on your own infrastructure)	Claude API key + the `managed-agents-2026-04-01` beta header (no subscription path)

Where billing actually differs

The cleanest way to split these four is by the wallet they draw from. The two interactive products are funded by a subscription; the two programmatic surfaces are funded by the API. This is the single distinction that trips people up most often, so it is worth stating plainly in its own table.

Surface	Billing model	Notes
Claude Cowork	Subscription	Included on Pro, Max, Team, and Enterprise. Multi-step tasks consume more of your usage allocation than chatting.
Claude Code	Subscription or API	Most surfaces require a Claude subscription or a Console account; the terminal CLI and VS Code also support third-party providers.
Claude Agent SDK	API (pay-as-you-go)	Authenticated with an `ANTHROPIC_API_KEY`; also supports Bedrock, Claude Platform on AWS, Vertex AI, and Azure. Anthropic does not permit claude.ai login for third-party agents built on the SDK.
Managed Agents	API (credits)	Requires a Claude API key and the beta header; enabled by default for API accounts.

One dated nuance is worth pinning down because it changes how subscription users pay for programmatic work. Starting June 15, 2026, Claude Agent SDK and claude -p usage on subscription plans no longer counts toward your Claude plan’s interactive usage limits; instead, eligible subscribers receive a separate monthly Agent SDK credit (per-user, not pooled), while subscription usage limits stay reserved for interactive use of Claude Code, Cowork, and Claude. If you use the Agent SDK with an API key from the Claude Platform, nothing changes â€” pay-as-you-go billing continues and you do not receive an Agent SDK monthly credit.

SDK vs Managed Agents: the programmatic split

Both programmatic surfaces let Claude run tools autonomously, but they differ in where the loop and the work live. Anthropic’s own comparison frames it this way: the Agent SDK “is a library that runs the agent loop inside your own process,” while Managed Agents “is a hosted REST API: Anthropic runs the agent and the sandbox, and your application sends events and streams back results.” Pick by who you want operating the infrastructure.

Dimension	Agent SDK	Managed Agents
Runs in	Your process, your infrastructure	Anthropic-managed infrastructure
Interface	Python or TypeScript library	REST API
Agent works on	Files on your infrastructure	A managed sandbox per session
Session state	JSONL on your filesystem	Anthropic-hosted event log
Best for	Local prototyping; agents that work directly on your filesystem and services	Production agents without operating sandbox/session infrastructure; long-running, asynchronous sessions

A common path, per Anthropic’s docs, is to prototype with the Agent SDK locally, then move to Managed Agents for production.

Quick chooser

If you are not writing code and want Claude to finish a task on your computer, use Cowork. If you are a developer working interactively on a codebase, use Claude Code. If you are building your own agent and want it to run in your own process, use the Agent SDK. If you want Anthropic to run the agent and host the sandbox for long-running or production work, use Managed Agents.

Is Claude Cowork the same as Claude Code?

No. Both appear in the Claude desktop app, but Cowork is aimed at knowledge work (research, documents, spreadsheets, file management) for non-developers, while Claude Code is an agentic coding tool. Cowork runs only in the desktop app (macOS or Windows); Claude Code also runs in the terminal, VS Code, JetBrains, and the browser.

Does a Claude subscription cover the Agent SDK or Managed Agents?

Cowork and Claude Code are included with Claude subscriptions (Pro, Max, Team, Enterprise). The Agent SDK and Managed Agents are API surfaces authenticated with a Claude API key. As of June 15, 2026, subscription users do get a separate monthly Agent SDK credit for SDK and claude -p usage, but Managed Agents has no subscription path â€” it requires an API key and a beta header.

Where does the work actually execute for each surface?

Cowork runs on your own computer in the desktop app. Claude Code runs on your machine (or in the browser). The Agent SDK runs in your own process and infrastructure. Managed Agents executes in an Anthropic-managed cloud sandbox per session, or a self-hosted sandbox you control.

Is the Agent SDK built on Claude Code?

Yes. Per Anthropic, the Agent SDK “gives you the same tools, agent loop, and context management that power Claude Code, programmable in Python and TypeScript.” Anthropic also describes it as “Claude Code as a library.”

Is Managed Agents generally available?

No. As of June 13, 2026, Claude Managed Agents is in beta. Every Managed Agents endpoint requires the managed-agents-2026-04-01 beta header (the SDK sets it automatically), and access is enabled by default for API accounts.

June 13, 2026

Claude vs GPT vs Gemini: Coding Benchmark Leaderboard (June 2026)

Last verified: June 13, 2026

As of June 13, 2026, the four models most often compared for coding work are Claude Fable 5 and Claude Opus 4.8 from Anthropic, GPT-5.5 from OpenAI, and Gemini 3.1 Pro from Google. This page is a leaderboard built on one rule: every score below is taken from a vendor’s own page or the benchmark’s official model card that we fetched on the verification date, or it is marked as not published. Several vendors publish their benchmark tables as images rather than machine-readable text; where we could not read an official figure directly, we list the metric as not machine-verifiable and link to the source document instead of estimating. The result is a smaller table than most roundups, but every number in it is one you can click through and check.

Models and pricing (verified specs)

These columns are confirmed from each vendor’s official model documentation. Claude prices, context windows, and cutoffs come from Anthropic’s models overview and the AWS Bedrock model card; GPT-5.5 from OpenAI’s developer docs; Gemini 3.1 Pro from Google’s DeepMind model card and the Gemini API pricing page.

Model	API ID	Input / Output (per Mtok)	Context	Max output	Knowledge cutoff
Claude Fable 5	`claude-fable-5`	$10 / $50	1M	128K	Not stated on overview*
Claude Opus 4.8	`claude-opus-4-8`	$5 / $25	1M	128K	Jan 2026
GPT-5.5	`gpt-5.5`	$5 / $30	1,050,000	128K	Dec 1, 2025
Gemini 3.1 Pro	`gemini-3.1-pro-preview`	$2 / $12 (â‰¤200K)**	1M	64K	Not stated on model card

*Anthropic’s models overview lists Fable 5’s specs and price but does not publish a knowledge-cutoff date for it in the table we fetched. **Gemini 3.1 Pro uses tiered pricing: $2 / $12 per Mtok for prompts up to 200K tokens, rising to $4 / $18 for prompts above 200K tokens (Google AI pricing page). GPT-5.5 pricing rises to 2x input / 1.5x output above 272K input tokens (OpenAI developer docs). Claude Opus 4.8 offers an optional fast mode at $10 / $50 per Mtok (Anthropic).

Coding benchmark scores (primary-source only)

Each cell is either a figure we read directly from a primary source on June 13, 2026, or marked “not machine-verifiable” with the source you should consult. A blank-equivalent entry never means zero â€” it means the official figure was not available in readable form during verification. Note the harness and version differences called out in the footnotes: they make cross-vendor cells not strictly comparable.

Benchmark	Claude Fable 5	Claude Opus 4.8	GPT-5.5	Gemini 3.1 Pro
SWE-bench Verified	Not machine-verifiable (see system card)	Not machine-verifiable (see system card)	Not published in retrievable primary source	80.6%
SWE-bench Pro (Public)	Not machine-verifiable (see system card)	Not machine-verifiable (see system card)	Not published in retrievable primary source	54.2%
Terminal-Bench	Not machine-verifiable (see system card)	Not machine-verifiable (see system card)	83.4% (v2.1, Codex CLI harness)â€	68.5% (v2.0, Terminus-2 harness)
LiveCodeBench Pro	Not published in retrievable primary source	Not published in retrievable primary source	Not published in retrievable primary source	2887 Elo

â€ GPT-5.5’s Terminal-Bench 2.1 figure of 83.4% is the score Anthropic attributes to GPT-5.5 “with the Codex CLI harness” in a footnote on its Claude Opus 4.8 announcement page. It is a competitor-reported comparison, not a number we read from OpenAI directly. Google reports Gemini 3.1 Pro on Terminal-Bench 2.0 under the Terminus-2 harness (68.5%); because the version and harness differ, the Gemini and GPT-5.5 Terminal-Bench cells are not directly comparable. Gemini’s SWE-bench Verified (80.6%), SWE-bench Pro Public (54.2%), and LiveCodeBench Pro (2887 Elo) are single-attempt figures from Google’s official Gemini 3.1 Pro model card.

What we could not verify from a primary source

Anthropic publishes its coding comparison tables for Claude Opus 4.8 and Claude Fable 5 as images inside its announcement pages, and the full Claude Opus 4.8 System Card PDF exceeded our fetch size limit, so we could not machine-read those percentages on the verification date. OpenAI’s GPT-5.5 announcement page returned an access error to our fetcher, and its developer-docs model page lists specs and pricing but no benchmark scores. We have therefore left Claude’s and GPT-5.5’s SWE-bench figures out of the table rather than reproduce numbers we could not confirm at the source. For those figures, consult the primary documents linked in our source list: the Claude Opus 4.8 System Card, the Claude Fable 5 and Mythos 5 announcement, and OpenAI’s GPT-5.5 page. If you are choosing a model today, the verified spec table above (price, context, output, cutoff) is the part you can rely on without caveat.

How to read a coding leaderboard

Three cautions apply to any 2026 coding comparison. First, harness matters: the same model scores differently on Terminal-Bench depending on whether it runs under Terminus-2, a Codex CLI scaffold, or a vendor’s internal agent, which is why we annotate every Terminal-Bench cell. Second, version matters: “Terminal-Bench 2.0” and “Terminal-Bench 2.1” are different test sets, and “SWE-bench Pro” public and full splits differ â€” a single percentage with no version is close to meaningless. Third, a headline score is one slice of behavior; long-horizon agentic coding, tool-call reliability, and context handling over a long session often decide real-world usefulness more than a single pass rate. Treat the verified cells here as a starting point, then test the shortlist on your own repository.

Which model has the highest published coding benchmark score in June 2026?

We cannot crown a single winner from primary sources alone, because Anthropic and OpenAI publish their coding scores in formats we could not machine-verify on June 13, 2026. From figures we could read directly, Google’s Gemini 3.1 Pro model card reports 80.6% on SWE-bench Verified and 54.2% on SWE-bench Pro (Public). Anthropic’s and OpenAI’s comparable figures are in their system cards and announcement pages, which we link in the sources; we did not reproduce them here because they were not readable at the source during verification.

What does Claude Fable 5 cost, and how is it different from Opus 4.8?

Claude Fable 5 (claude-fable-5) is priced at $10 per million input tokens and $50 per million output tokens, with a 1M-token context window and up to 128K output tokens (Anthropic models overview). Claude Opus 4.8 (claude-opus-4-8) is the Opus-tier flagship at $5 / $25 per Mtok, also 1M context and 128K output, with a January 2026 knowledge cutoff. Fable 5 is Anthropic’s most capable widely released model; Opus 4.8 is the lower-priced model most teams will use for everyday agentic coding.

Why are some benchmark cells marked “not machine-verifiable” instead of showing a number?

Because this page only prints scores we could confirm from a primary source on the verification date. Several vendors render their benchmark tables as images, and one large system-card PDF exceeded our fetch limit, so the underlying percentages were not readable to us. Rather than copy figures from third-party trackers, we mark the cell and point you to the official document. It keeps the leaderboard honest at the cost of being shorter.

How do the context windows compare?

Claude Fable 5, Claude Opus 4.8, and Gemini 3.1 Pro each offer a 1M-token context window; GPT-5.5 offers 1,050,000 tokens. Maximum output is 128K tokens for Claude Fable 5, Claude Opus 4.8, and GPT-5.5, and 64K tokens for Gemini 3.1 Pro. Note that Claude Opus 4.8’s context window is 200K on Microsoft Foundry specifically, per Anthropic’s documentation.

Is Terminal-Bench comparable across these models?

Not cell-for-cell. Google reports Gemini 3.1 Pro on Terminal-Bench 2.0 under the Terminus-2 harness (68.5%), while the GPT-5.5 figure we show (83.4%) is Terminal-Bench 2.1 under a Codex CLI harness, as attributed by Anthropic. Different versions and different harnesses mean the two numbers should not be read as a head-to-head result.

June 13, 2026

Latest Claude Models — June 2026 (Current Lineup, Pricing, and Specs)

Updated June 12, 2026. Fable 5 is the current top-tier model, released June 9, 2026. The full lineup: Fable 5 → Opus 4.8 → Sonnet 5 → Haiku 4.5. Pricing and availability verified against Anthropic’s official docs.

Current Claude Models — Quick Reference

Model	API ID	Input $/MTok	Output $/MTok	Context	Best For
Claude Fable 5 🆕	`claude-fable-5`	$10.00	$50.00	1M tokens	Complex engineering, long-horizon agentic work
Claude Opus 4.8	`claude-opus-4-8`	$5.00	$25.00	1M tokens	Everyday advanced work, high-volume pipelines
Claude Sonnet 5	`claude-sonnet-5`	$3.00	$15.00	1M tokens	Balanced capability and speed
Claude Haiku 4.5	`claude-haiku-4-5-20251001`	$1.00	$5.00	200K tokens	High-speed, cost-sensitive tasks

All four models support vision, tool use/function calling, and batch processing. Fable 5 and Opus 4.8 are available on AWS Bedrock, Google Cloud Vertex AI, and Microsoft Azure AI Foundry in addition to the direct Anthropic API.

Claude Fable 5 (June 9, 2026)

Fable 5 is Anthropic’s first publicly available Mythos-class model — a capability tier previously restricted to research and select enterprise partners. It’s the most capable model Anthropic has released to date.

What makes Fable 5 different:

SWE-bench Verified: 95.0% (vs 88.6% for Opus 4.8)
SWE-bench Pro: 80.0% (vs 69.2%)
Senior Engineer benchmark: 91/100 (vs ~63/100)
Adaptive extended thinking (always on, not a mode switch)

Important limitations:

2x the cost of Opus 4.8 ($10/$50 vs $5/$25)
Mandatory 30-day data retention — not available under zero data retention (ZDR)
Safety classifiers route cybersecurity, biology, chemistry, and distillation prompts to an Opus 4.8 fallback — you pay Fable 5 rates for Opus 4.8 output in those domains
Higher latency on complex tasks (60 seconds to several minutes vs 3–15 seconds for Opus 4.8)

Free through June 22, 2026: Claude Pro, Max 5x, Max 20x, Team, and Enterprise subscription plans include Fable 5 at no extra charge during the launch window.

Claude Opus 4.8

Opus 4.8 is Anthropic’s current workhorse for serious work — the right default for most API applications and Claude Code use. It supports zero data retention (ZDR), which Fable 5 does not.

Key specs:

Context: 1M tokens
Max output: 32K tokens per request
Extended thinking: Available (opt-in mode)
ZDR: Yes
Batch API: Yes (50% discount on batch processing)

Use Opus 4.8 as your default model unless you have a specific reason to go up to Fable 5 or down to Sonnet/Haiku. It hits the best balance of capability, speed, cost, and data policy flexibility.

Claude Sonnet 5

Sonnet 5 targets use cases where response speed matters and you don’t need Opus-level reasoning. It’s the model Anthropic’s own infrastructure runs Claude.ai on for Pro subscribers doing day-to-day chat work.

Key specs:

Context: 1M tokens
Max output: 64K tokens per request
Extended thinking: Available
ZDR: Yes
Typical latency: 1–5 seconds for most tasks

Good for: content generation pipelines, customer-facing chat, document analysis at volume, anything where sub-5-second response time matters.

Claude Haiku 4.5

Haiku 4.5 is Anthropic’s fastest and cheapest model. At $1 input / $5 output per million tokens, it’s 10x cheaper than Fable 5.

Key specs:

Context: 200K tokens (smaller than the Opus/Sonnet/Fable 1M window)
Max output: 16K tokens per request
Latency: Sub-second for most tasks
ZDR: Yes

The 200K context window is the main limitation. For tasks that fit within that window — classification, short-form generation, routing, extraction — Haiku 4.5 is the cost-optimal choice. For longer documents or conversations, step up to Sonnet or Opus.

How to Choose the Right Claude Model

The decision framework I use:

Does the task require multi-step reasoning, complex coding, or long-horizon autonomy? → Fable 5 (if cost and latency are acceptable) or Opus 4.8
Is this a routine task at reasonable volume? → Opus 4.8 as the default
Does latency matter more than maximum reasoning depth? → Sonnet 5
Is this high-volume, short-context, cost-sensitive work? → Haiku 4.5
Does your use case require zero data retention? → Any model except Fable 5

Most production applications use a routing strategy: Fable 5 or Opus 4.8 for the hard jobs, Haiku 4.5 for classification and pre-processing, Sonnet 5 for user-facing response generation.

Claude Subscription Plans and Model Access (June 2026)

Plan	Price	Models Included
Free	$0	Limited Sonnet 5 access
Pro	$20/mo ($17 annual)	Sonnet 5, Opus 4.8, Fable 5 (through June 22)
Max 5x	$100/mo	All models, 5x usage vs Pro
Max 20x	$200/mo	All models, 20x usage vs Pro
Team Standard	$20/seat/mo (annual), $25 month-to-month	All models + admin features
Team Premium	$100/seat/mo (annual), $125 month-to-month	All models + priority + advanced admin
Enterprise	$20/seat + usage at API rates (contact sales)	All models + ZDR + custom retention + SSO

Claude Code (the CLI tool) is included in all paid subscription plans. API access for building your own applications is separate — billed per token via the Anthropic Console regardless of subscription status.

Legacy Models (Still Available, No Longer Latest)

These models are still available via the API but are not Anthropic’s current recommended versions:

Claude Opus 4.7 (claude-opus-4-7) — prior Opus tier, succeeded by 4.8
Claude Opus 4.6 (claude-opus-4-6) — two generations back
Claude Sonnet 4.5 (claude-sonnet-4-5) — prior Sonnet tier
Claude 3.5 Haiku / Sonnet / Opus — Claude 3.x generation, still functional for legacy integrations

If you’re building a new application, start with the current lineup. Legacy model IDs are useful for maintaining compatibility in existing applications that haven’t been updated.

Platform Availability

Platform	Fable 5	Opus 4.8	Sonnet 5	Haiku 4.5
Anthropic API (direct)	✓	✓	✓	✓
AWS Bedrock	✓	✓	✓	✓
Google Cloud Vertex AI	✓	✓	✓	✓
Microsoft Azure AI Foundry	✓	✓	✓	✓
GitHub Copilot	✓ (via Foundry)	✓	✓	—

Frequently Asked Questions

What is the newest Claude model?
As of June 2026, Claude Fable 5 is the newest and most capable model Anthropic has released. It launched June 9, 2026. The API model ID is claude-fable-5.

Is Claude Fable 5 the same as Claude 5?
No. Anthropic changed the naming convention — there is no “Claude 5.” The Fable series is a new tier above the Opus/Sonnet/Haiku hierarchy. Fable 5 is Anthropic’s first Mythos-class model released for general availability.

What is the most powerful Claude model?
Claude Fable 5 is currently the most powerful. For tasks where Fable 5’s safety classifier routing applies (cybersecurity, biology, chemistry, distillation), or where zero data retention is required, Claude Opus 4.8 is the appropriate top-tier choice.

What Claude model does Claude.ai use by default?
Depends on your plan. Free tier uses a limited version of Sonnet. Pro and Max subscribers access Opus 4.8 as the default with Fable 5 available (through June 22, 2026 included, after that plan-dependent). Claude.ai routes to the appropriate model for your plan automatically.

How do I use the latest Claude model in the API?
Set the model parameter in your API request to the model ID. For Fable 5: "model": "claude-fable-5". For Opus 4.8: "model": "claude-opus-4-8". See the full API reference at console.anthropic.com.

What’s the difference between Claude Opus 4.8 and Fable 5?
Fable 5 is significantly stronger on complex engineering tasks — SWE-bench Pro: 80% vs 69.2%, and Senior Engineer benchmark: 91 vs ~63 out of 100. The trade-off: Fable 5 costs 2x more ($10/$50 vs $5/$25 per MTok), has higher latency, and requires 30-day data retention. For most work, Opus 4.8 is the right choice. Full Fable 5 vs Opus 4.8 breakdown here.

Changelog

June 12, 2026 — Added Claude Fable 5 (released June 9). Updated pricing table. Added platform availability table.
May 2026 — Claude Opus 4.8 and Sonnet 4.6 were the current top-tier models.

This page is updated as Anthropic releases new models. Last verified: June 12, 2026. For API pricing, check console.anthropic.com — the canonical source.

June 12, 2026

Claude Fable 5: Capabilities, Pricing ($10/$50), and When to Use It Over Opus 4.8

Anthropic released Claude Fable 5 on June 9, 2026 — and it’s the most capable model the company has ever made publicly available. After tracking every Claude release since the original 100K context window dropped, I can say this one is different. Fable 5 isn’t just an incremental update. It’s Anthropic’s Mythos-class model — the one they’d been keeping restricted — now opened up to anyone with an API key or a Claude subscription.

Here’s what you need to know: the pricing, the benchmarks, and the specific decision framework for when to use Fable 5 versus sticking with Opus 4.8.

Quick answer: Fable 5 costs $10/$50 per million input/output tokens (2x the cost of Opus 4.8). It outperforms Opus 4.8 significantly on complex coding, long-horizon tasks, and scientific research. Use Fable 5 when quality on hard problems justifies the cost. Use Opus 4.8 for high-volume, well-scoped, routine work.

What Is Claude Fable 5?

Claude Fable 5 (claude-fable-5) is Anthropic’s first publicly available Mythos-class model. The Mythos line is Anthropic’s highest capability tier — models that were previously restricted to research and select enterprise partners because of their raw power. Fable 5 is the version Anthropic deemed safe enough to release broadly.

The name shift (from the Opus/Sonnet/Haiku tier naming) signals something intentional. Fable 5 sits above the Opus line entirely. It’s a new ceiling.

Key specs:

Context window: 1M tokens (same as Opus 4.8)
Max output: 128K tokens per request
Thinking: Adaptive (always on — not a separate “thinking mode”)
Vision: Yes
Tool use / function calling: Yes
Available: Claude API, AWS Bedrock, Vertex AI, Microsoft Foundry

Claude Fable 5 Pricing

Model	Input (per MTok)	Output (per MTok)	Context
Claude Fable 5	$10.00	$50.00	1M tokens
Claude Opus 4.8	$5.00	$25.00	1M tokens
Claude Sonnet 4.6	$3.00	$15.00	1M tokens
Claude Haiku 4.5	$1.00	$5.00	200K tokens

Fable 5 costs exactly 2x Opus 4.8 on API. On subscription plans (Pro, Max, Team, Enterprise seat-based), Fable 5 is included at no extra cost through June 22, 2026.

The free-until-June-22 window matters if you’re evaluating whether to route your workloads to Fable 5. Use that window to benchmark it against your actual tasks before the 2x cost kicks in.

Benchmark Performance: Where Fable 5 Pulls Away

The benchmarks that matter most are the ones that measure what the model can do on real engineering work, not trivia:

Benchmark	Claude Fable 5	Claude Opus 4.8	Delta
SWE-bench Verified	95.0%	88.6%	+6.4 pts
SWE-bench Pro	80.0%	69.2%	+10.8 pts
FrontierCode	29.3%	13.4%	~2.2x
Senior Engineer benchmark	91/100	~63/100	+45% absolute

The Senior Engineer benchmark is the one I find most telling. It’s designed to be hard for people who write code for a living — and Fable 5 scores 45 percentage points higher than Opus 4.8. That gap is significant enough that it changes the calculus for serious engineering work.

When to Use Claude Fable 5 (vs Opus 4.8)

I’ve been routing tasks between models for long enough to have a framework. Here’s how I think about it:

Use Fable 5 when:

You’re running a large migration, refactor, or multi-stage software project
Quality on a hard problem matters more than per-token cost
You’re doing deep research, complex analysis, or long-horizon agentic work
The task would otherwise take a senior engineer half a day or more
You’re in the free evaluation window (through June 22) and want to benchmark

Use Opus 4.8 when:

The task is well-scoped and routine
You’re running high-volume pipelines where 2x cost compounds fast
Latency matters — Fable 5 can take 60 seconds to several minutes on complex tasks vs 3–15 seconds for Opus 4.8
The task falls in Fable 5’s restricted domains (cybersecurity, biology, chemistry, distillation) — in those categories, Fable 5 routes to Opus 4.8 anyway, so you’d pay Fable 5 prices for Opus 4.8 output

The smart routing strategy: Fable 5 for the hard jobs, Opus 4.8 for the rest. Don’t use Fable 5 as your default model — the cost and latency delta aren’t worth it for routine tasks.

Important Limitations to Know Before You Switch

Two limitations that don’t get enough coverage:

1. Safety classifier routing. Fable 5 includes enhanced safety classifiers. For prompts touching cybersecurity, biology, chemistry, and distillation, those classifiers route the request to a Claude Opus 4.8 fallback. You pay Fable 5 API rates ($10/$50) but get Opus 4.8 output. If your use case is in these domains, Fable 5 is not the upgrade it appears to be.

2. Data retention requirement. Fable 5 carries a mandatory 30-day data retention policy — Anthropic needs retained prompts and outputs to operate the safety classifiers. Claude Opus 4.8 is available under zero data retention (ZDR). If your use case requires ZDR (healthcare, legal, finance with strict data handling), stick with Opus 4.8 until Anthropic updates Fable 5’s data policy.

Availability

Claude Fable 5 is generally available as of June 9, 2026 on:

Claude API (claude-fable-5)
Claude Platform on AWS / Amazon Bedrock
Google Cloud Vertex AI
Microsoft Azure AI Foundry / GitHub Copilot

Subscription access (free through June 22, 2026): Claude Pro ($20/mo), Max 5x ($100/mo), Max 20x ($200/mo), Team, and seat-based Enterprise plans all include Fable 5 access at no extra charge during the launch window. After June 22, the plan-tier access picture may change — check Anthropic’s pricing page for updates.

How This Changes the Claude Model Decision Tree

Before Fable 5, the Claude decision tree was straightforward:

Need the best? → Opus 4.8
Need balance? → Sonnet 4.6
Need speed/cost? → Haiku 4.5

Now it’s:

Hard problems, complex projects, long-horizon work → Fable 5
Everyday work, high-volume pipelines → Opus 4.8
Balance of cost and capability → Sonnet 4.6
Speed and cost optimization → Haiku 4.5

The introduction of a model tier above Opus 4.8 doesn’t replace the existing lineup — it creates a new ceiling for the work that genuinely needs it.

Frequently Asked Questions

Is Claude Fable 5 better than Opus 4.8?
For complex coding, multi-stage tasks, and long-horizon work: yes, significantly. On SWE-bench Pro, Fable 5 scores 80.0% vs Opus 4.8’s 69.2% — a 10+ point gap. For routine, well-scoped tasks: the gap narrows enough that Opus 4.8’s 2x cost advantage makes it the smarter choice.

What is the Claude Fable 5 API model ID?
claude-fable-5. This is the API string you pass to model in your API calls.

Does Fable 5 cost more than Opus 4.8?
Yes — exactly 2x. Fable 5 is $10 input / $50 output per million tokens. Opus 4.8 is $5/$25. Through June 22, 2026, Fable 5 is included in Claude subscription plans at no extra cost.

Can I use Claude Fable 5 for free?
On Pro, Max, Team, and Enterprise subscription plans, yes — through June 22, 2026. API access is metered at $10/$50 per MTok from day one.

Does Claude Fable 5 support zero data retention (ZDR)?
No. Fable 5 carries a mandatory 30-day data retention requirement. If your use case requires ZDR, use Claude Opus 4.8, which supports it.

What’s the difference between Claude Fable 5 and Claude Mythos 5?
Mythos 5 is Anthropic’s fully restricted research model — not publicly available. Fable 5 is the Mythos-class model that Anthropic has prepared for general availability, with safety classifiers and the 30-day retention policy. You can think of Fable 5 as “Mythos for the real world.”

Last verified: June 12, 2026. Anthropic pricing and availability subject to change — check Anthropic’s pricing page for current rates.

June 12, 2026

The Signal: AI Just Split Into Two Lanes — Field Notes From June 10, 2026

The Signal is a daily AI intelligence briefing from Tygart Media — field notes from someone who builds with these tools 12 hours a day, not someone who reads press releases about them. Each edition distills the day’s most consequential AI and search developments into what they actually mean for agencies, small business operators, and builders shipping real infrastructure.

June 10, 2026: The Day the Lanes Forked

Today was the kind of day where you can feel the road forking under your tires. Not because one thing happened — because eight things happened simultaneously, and if you squint at the pattern, they all point the same direction: AI just stopped being a product category and started being infrastructure. The plumbing layer. The thing you build on top of, not the thing you buy.

I’ve been building with Claude since the Haiku days. I run it 12 hours a day across 20+ WordPress sites, a five-site knowledge cluster on Google Cloud, and a custom schema engine I shipped yesterday. When the landscape shifts, I don’t read about it on TechCrunch — I feel it in the tooling. And today, the tooling lurched forward in a way that matters.

Here’s the daily signal.

Claude Fable 5: Mythos-Class AI Goes Public

Anthropic launched Claude Fable 5 yesterday — the first publicly available Mythos-class model, a tier above Opus. Pricing is $10 per million input tokens and $50 per million output tokens. It’s the most capable model Anthropic has ever released to the general public, state-of-the-art on nearly every benchmark, and it comes with a fascinating constraint: queries on certain topics automatically route to Opus 4.8 instead, triggering in less than 5% of sessions. Anthropic is essentially saying: here’s the most powerful thing we’ve ever built, and we’ve installed guard rails at the edge cases where power becomes risk.

For agencies and small business operators, the practical read is this: Fable 5 is included on Pro, Max, Team, and Enterprise plans through June 22 at no extra cost. After that, it comes off the subscription tiers. If you’re building workflows that depend on Mythos-class reasoning, you have 12 days to test whether the capability justifies the API cost — or whether Opus and Sonnet handle your actual use cases just fine.

The real signal isn’t the model itself. It’s that Anthropic also doubled Cowork limits at no charge and shipped Claude Managed Agents in public beta. They’re not just selling you a smarter model — they’re selling you an operating system for delegating work to AI. That’s a fundamentally different product than a chatbot.

Meanwhile, I Was Building the Infrastructure Layer — Not Reading About It

While the tech press was writing headlines about Fable 5, I was elbow-deep in the kind of work that actually turns these models into business value. Yesterday, across a 14-hour session, my team — which at this point is me and a fleet of Claude instances — shipped three things that matter more to my clients than any benchmark score:

1. bcesg-knowledge-api v1.5.0 — a custom WordPress plugin I built and deployed across BCESG.org that outputs a JSON-LD @graph array containing Article, FAQPage, Organization, WebPage, BreadcrumbList, Person (author), and speakable schema — all generated from 13 custom meta fields. This isn’t a schema plugin you install from the WordPress directory. It’s a purpose-built schema engine designed for one thing: making every page on the site machine-readable enough that AI systems cite it as an authoritative source. That’s Generative Engine Optimization at the infrastructure level, not the content level.

2. WordPress 7.0 across the entire knowledge cluster. All five sites — bcesg.org, restorationintel.com, riskcoveragehub.com, continuityhub.org, and healthcarefacilityhub.org — upgraded from WP 6.9.4 to 7.0. Why does this matter? Because WordPress 7.0 ships the Abilities API: agent-to-agent communication endpoints. That means my Claude-powered content pipelines can now negotiate directly with WordPress about what they’re allowed to do, without me acting as the middleware. The cluster just became AI-native infrastructure.

3. The stack around it. RankMath SEO installed with the schema module deliberately disabled — because the custom plugin handles schema, and two schema systems fighting each other is worse than none at all. IndexNow for instant search engine notification on every publish and update. Microsoft Clarity for behavioral analytics so I can see what humans actually do when they land on AI-optimized content.

And here’s the detail that would have been impossible to explain six months ago: the peer review on the bcesg-knowledge-api plugin was done by Claude Fable 5 reviewing the code that Claude Opus wrote. AI reviewing AI’s code. In production. On a live WordPress cluster. That’s not a demo — that’s Tuesday.

OpenAI’s S-1 and the $965 Billion Elephant

OpenAI filed a confidential S-1 with the SEC. They’re going public. Meanwhile, Anthropic hit a $965 billion valuation. These two facts, side by side, tell you everything about where the money thinks AI is going: it’s going to be the most valuable infrastructure layer since cloud computing, and the market is pricing it that way before most businesses have figured out how to use it.

For small business owners and agency operators, this isn’t abstract finance news. It means the tools you’re using today — Claude, GPT, Gemini — are backed by companies with enough capital to keep shipping improvements for years. The platform risk isn’t that these companies disappear. The platform risk is that you don’t build on them fast enough and your competitors do.

AI Passed the Turing Test. Now What?

A UC San Diego study published in PNAS confirmed that OpenAI’s GPT-4.5 and Meta’s Llama-3.1-405B both passed a standard three-party Turing test — with GPT-4.5 being identified as human 73% of the time when given a persona prompt, significantly more often than actual human participants. This has been treated as a milestone headline, and it is one, but the practical implication is more subtle than “AI can fool humans.”

What it actually means: the content quality bar just moved permanently. If AI can produce text that’s indistinguishable from a human expert, then the only content that wins is content with something AI can’t fake — lived experience, proprietary data, operational specifics, the kind of “I shipped this yesterday and here’s what happened” detail that no model can generate from training data. This is why I write The Signal as field notes, not as analysis. Analysis can be generated. Field notes from the arena cannot.

Chrome WebMCP: The Browser Becomes an AI Endpoint

Google shipped the Chrome WebMCP API in Origin Trial for Chrome 149 through 156. The Model Context Protocol — the same protocol that lets Claude connect to external tools, databases, and APIs — is now a browser-native capability. Web applications can expose structured tool interfaces that AI models call directly.

This is a bigger deal than it sounds. Right now, when Claude interacts with a web application, it’s either through a dedicated MCP server or through browser automation (clicking pixels on a screen like a human would). WebMCP means any web app can define a structured API surface that AI agents consume natively. For agencies building client tools, this is the moment your internal dashboards and client portals become AI-ready without a full backend rewrite.

If you’re running WordPress sites — and 43% of the web is — this has direct implications for how AI agents interact with your content management layer. The gap between “website” and “AI-accessible knowledge base” just narrowed dramatically.

The GPU Infrastructure Play: xAI Becomes an AI REIT

Elon Musk’s xAI, home of Grok, is increasingly looking less like an AI model company and more like a GPU real estate investment trust. They’re partnering with both Anthropic and Google to provide compute infrastructure. This is the clearest sign yet that the AI industry is stratifying into two distinct layers: model companies (who build the brains) and infrastructure companies (who build the data centers those brains run in).

For builders, this is good news. More compute supply means more pricing competition means lower API costs over time. The $10/$50 per million tokens for Fable 5 today will look expensive in 18 months.

The Security Layer Nobody’s Talking About

HashiCorp announced Boundary for agentic AI — access security specifically designed for AI agents that need to authenticate across multiple systems. And MemPalace shipped a local-first AI memory system with 96.6% recall accuracy and 29 MCP tools for Claude Code.

These aren’t headline products. They’re infrastructure connective tissue. When AI agents can securely authenticate across your entire tool stack (HashiCorp Boundary) and maintain persistent memory across sessions (MemPalace), you stop using AI for one-off tasks and start using it as a persistent operational layer. That’s the transition my agency is making right now — from “Claude helps me write articles” to “Claude runs the content pipeline while I focus on strategy.”

What This All Means: The Two-Lane Highway

Here’s the pattern I see when I lay these signals side by side:

Lane 1: The AI product lane. This is where most people are. They use ChatGPT to draft emails. They ask Claude to summarize documents. They treat AI as a productivity tool, like a faster Google or a better autocomplete. This lane is getting crowded, commoditized, and — with the Turing test results — increasingly indistinguishable from one provider to the next.

Lane 2: The AI infrastructure lane. This is where the alpha is. Custom schema engines. Agent-to-agent communication via the WordPress Abilities API. Browser-native MCP endpoints. Persistent AI memory. Secure multi-system authentication for autonomous agents. This lane is where you stop using AI and start building on AI — where it becomes the foundation layer of your operations, not an add-on.

The gap between these two lanes is widening every day. Today’s eight signals all point the same direction: toward a world where the businesses that win aren’t the ones that use AI tools the best, but the ones that build AI infrastructure the fastest.

I’m building in Lane 2. Yesterday it was a custom schema engine and a WordPress 7.0 cluster upgrade. Today it’s field-testing Fable 5 as a code reviewer. Tomorrow it’ll be whatever the next signal demands.

The question isn’t whether AI is going to transform your industry. That’s settled. The question is whether you’re in the arena building the infrastructure, or on the sidelines reading about people who are.

— Will Tygart, Tygart Media

Frequently Asked Questions

What is Claude Fable 5 and how does it differ from Claude Opus?

Claude Fable 5 is Anthropic’s first publicly available Mythos-class AI model, released June 9, 2026. It sits a tier above Claude Opus in capability, priced at $10 per million input tokens and $50 per million output tokens. Fable 5 is state-of-the-art on nearly all tested benchmarks and includes built-in safeguards that route certain queries to Opus 4.8, triggering in less than 5% of sessions. It’s available free on subscription plans through June 22, 2026.

What is the Chrome WebMCP API and why does it matter for businesses?

The Chrome WebMCP API, now in Origin Trial for Chrome versions 149 through 156, brings the Model Context Protocol natively into the browser. This allows web applications to expose structured tool interfaces that AI models can call directly — eliminating the need for dedicated backend integrations or browser automation. For businesses running web-based tools, dashboards, or WordPress sites, this means your existing applications can become AI-accessible without a full rebuild.

What is the WordPress 7.0 Abilities API?

The WordPress 7.0 Abilities API provides agent-to-agent communication endpoints, allowing AI-powered systems to negotiate capabilities and permissions directly with a WordPress installation. This transforms WordPress from a content management system into AI-native infrastructure where automated pipelines can query what operations they’re authorized to perform without human middleware.

What does AI passing the Turing test mean for content creators?

A UC San Diego study published in PNAS found that OpenAI’s GPT-4.5 and Meta’s Llama-3.1-405B both passed a standard three-party Turing test in 2026 — GPT-4.5 was identified as human 73% of the time with persona prompting. For content creators, this permanently raises the quality bar — the only content that wins is content with elements AI cannot fake: lived experience, proprietary data, operational specifics, and first-person field reports that no model can generate from training data alone.

What is Generative Engine Optimization (GEO) and how does it work?

Generative Engine Optimization is the practice of structuring web content so AI systems — including ChatGPT, Claude, Gemini, Perplexity, and Google AI Overviews — cite, reference, and recommend it. GEO involves entity enrichment, structured data (JSON-LD schema), authoritative citations, and machine-readable formatting. Unlike traditional SEO which targets search engine crawlers, GEO targets the large language models that increasingly mediate how users discover information.

How should small businesses approach AI infrastructure in 2026?

Start by moving from Lane 1 (using AI as a productivity tool) to Lane 2 (building AI into your operational infrastructure). Practical first steps include implementing structured data and schema markup on your website, setting up AI-optimized content pipelines, ensuring your site is crawlable by AI systems via protocols like LLMS.txt, and testing agentic workflows where AI handles multi-step operational tasks autonomously rather than single-prompt interactions.

What is a custom schema engine and why build one instead of using plugins?

A custom schema engine is a purpose-built WordPress plugin that generates structured data (JSON-LD) tailored to specific business objectives — in this case, AI citation optimization. Unlike off-the-shelf schema plugins that generate generic markup, a custom engine outputs precisely the entity relationships, author signals, and speakable content markers that AI systems use when deciding which sources to cite. The bcesg-knowledge-api plugin generates a seven-type @graph array from 13 custom meta fields, providing a level of control that no general-purpose plugin offers.

What is the significance of AI reviewing AI-written code in production?

When Claude Fable 5 peer-reviewed code written by Claude Opus for a production WordPress plugin, it demonstrated a mature AI development workflow where different model tiers serve different roles — one for generation, another for quality assurance. This mirrors human development practices (developer writes, senior reviews) but at machine speed and cost. It’s a practical example of how AI agent collaboration is already operational in real business infrastructure, not just research demos.

The Signal is published daily on Tygart Media by Will Tygart. Each edition distills the day’s most consequential AI, search, and technology developments into actionable intelligence for agencies, small business operators, and builders shipping real AI infrastructure.

June 10, 2026

Claude Fable 5 Complete Guide

New in 2026

Everything you need to know about Anthropic’s new frontier tier — pricing, context window, model comparisons, and how to route the right work to the right model.

Updated June 2026
·
~14 min read
·
Includes interactive calculators

What Is Claude Fable 5?

Claude Fable 5 is Anthropic’s new frontier model tier — positioned above Opus in the lineup and designed for tasks where raw capability, extended reasoning depth, and massive context handling matter more than cost. Where Opus 4.8 set the bar for complex multi-step reasoning, Fable 5 raises it with a 1-million-token context window, enhanced agentic autonomy, and improved performance on long-horizon software engineering, research synthesis, and cross-domain analysis tasks.

The “Fable” naming signals a new generation of model architecture rather than an incremental update. Anthropic positions it as the model you reach for when a task exceeds what Opus can do reliably — not as a replacement for Opus, Sonnet, or Haiku in their respective cost tiers.

Quick Facts — Claude Fable 5

Context Window

tokens (~750K words)

Max Output

32K

tokens per response

Input Price

$10

per million tokens

Output Price

$50

per million tokens

Cache Write

$12.50

per million tokens

Cache Read

$1.00

per million tokens

Key positioning: Fable 5 is the model for tasks where Opus 4.8 produces reliable but imperfect results — long codebase audits, full-document analysis, complex multi-agent orchestration, and strategic synthesis across large corpora. For most production workflows, Sonnet remains the value pick.

Full Model Lineup Comparison

Here’s how the complete 2026 Claude lineup stacks up across every dimension that matters for production usage:

Model	Input $/M	Output $/M	Context	Max Out	Vision	Tool Use	Extended Think	Best For
◆ Fable 5	$10	$50	1M	32K	✓	✓	✓ Deep	Max-capability tasks, 1M+ context
◆ Opus 4.8	$5	$25	200K	32K	✓	✓	✓	Complex reasoning, agentic workflows
◆ Sonnet 4.6	$3	$15	200K	16K	✓	✓	✓	Production apps, content at scale
◆ Haiku 4.5	$1	$5	200K	8K	✓	✓	—	High-volume, latency-sensitive tasks

Prices are per million tokens. Cache read is 90% cheaper than standard input across all models. Batch API provides an additional 50% discount on both input and output.

Capability Matrix — What Each Model Can Do

Capability	Fable 5	Opus 4.8	Sonnet 4.6	Haiku 4.5
Full codebase analysis (>500K tokens)	✓ Native	⚠ Chunked	✗	✗
Extended thinking / chain-of-thought	✓ Deep	✓	✓	✗
Multi-step agentic orchestration	✓ Best	✓	Good	Limited
Computer use	✓	✓	✓	✗
MCP tool integration	✓	✓	✓	✓
Prompt caching	✓	✓	✓	✓
Batch API (50% discount)	✓	✓	✓	✓
PDF / document analysis	✓	✓	✓	Limited
Real-time streaming	✓	✓	✓	✓
Structured JSON output	✓	✓	✓	✓

Interactive Cost Calculator

Estimate your monthly API spend across the full model lineup. Enter your token volumes below — the calculator models prompt caching and Batch API discounts automatically.

Token Cost Calculator

Model

Monthly API Calls

Avg Input Tokens / Call

Avg Output Tokens / Call

Prompt Caching (90% off reads)
Batch API (50% off all)

Estimated Monthly Cost

$0.00

Which Claude Model Should You Use?

Answer three questions to get a model recommendation tailored to your use case.

Model Picker — 3 Questions

1. How large is your context? (document/codebase size)

Under 50K tokens

50K–200K tokens

200K–1M tokens

2. How complex is the task?

Simple / structured (classify, extract, format)

Moderate (draft, summarize, QA)

Complex (reason, plan, code, orchestrate)

3. How cost-sensitive is this workload?

Very — high volume, every cent counts

Moderate — quality matters more than cost

Not sensitive — quality and capability first

How We Actually Use Each Model

These are real production workflows mapped to the right tier — built from running Claude in content operations, publishing automation, and knowledge management at scale. No hypotheticals.

Haiku 4.5 — High Volume

Daily SEO Refresh Pipeline

25-post-per-day SEO metadata refresh
Article classification and tag assignment
Structured data extraction from web pages
Keyword density checks across large post archives
Link validation and redirect flagging

Sonnet 4.6 — Production Default

Editorial Content at Scale

Desk article writing (1,200–2,500 words)
Content brief execution from keyword clusters
FAQ and schema markup generation
Cross-site content adaptation and localization
Monthly client update drafts and summaries

Opus 4.8 — Complex Reasoning

Workers & Deep Refreshes

Agentic Notion Workers (multi-step pipelines)
Deep content refresh with competitive gap analysis
Multi-database synthesis and reporting
Strategy documents requiring extended reasoning
Code generation for automation scripts

Fable 5 — Max Capability

Portfolio Audits & Strategy

Full-site content audits (500+ posts in single context)
Cross-domain strategy synthesis across large corpora
Complex multi-agent orchestration at the flagship tier
Long-horizon planning requiring deep reasoning depth
Codebase-wide analysis and architecture review

Routing principle: The right model is the cheapest one that reliably completes the task. Haiku handles volume. Sonnet handles production. Opus handles complexity. Fable 5 handles scale + complexity together — specifically the cases where you’d need Opus and more context than Opus can hold.

The Economics: Routed vs All-Fable

Smart model routing is where API costs get controlled. Here’s a real-world comparison of a mixed content-and-automation workload at scale — routed vs running everything on Fable 5.

Workload	Monthly Volume	Routed Model	Routed Cost	All-Fable 5 Cost	Savings
SEO metadata batch refresh	750 posts/mo	Haiku 4.5 + Batch	$1.20	$18.75	93% less
Article drafting	90 articles/mo	Sonnet 4.6	$8.10	$67.50	88% less
Agentic worker runs	200 runs/mo	Opus 4.8	$22.50	$45.00	50% less
Full-site portfolio audits	4 audits/mo	Fable 5	$24.00	$24.00	—
Total	—	Routed	$55.80	$155.25	64% less

Stacking Discounts: Caching + Batch API

Two discount mechanisms compound independently:

Prompt caching: Cache your system prompt and shared context once. Subsequent requests pay ~10% of the input price for cache reads. On Fable 5, that’s $1.00/M instead of $10.00/M on cached tokens — a 90% reduction on your largest cost lever.
Batch API: Submit requests asynchronously (results within 24 hours) for a flat 50% discount on both input and output. Works on all four models. Best for non-real-time workloads like overnight refreshes, audits, or bulk classification.
Stacked: Caching + Batch combined can bring effective Fable 5 input cost from $10/M to ~$0.50/M on cached tokens — making it economically viable for high-volume tasks that previously only fit Haiku’s budget.

See our Claude context window guide for more on how to structure prompts to maximize cache hit rates.

Related Claude Resources

Claude Context Window Guide
Claude Projects Deep Dive
Claude Code Plan Mode
Claude MCP Setup Guide
Opus 4.7 Feature Deep Dive
Claude Code vs Cursor (2026)

Claude Fable 5 FAQ

Claude Fable 5 sits above Opus 4.8 in the lineup. The primary difference is context window size — Fable 5 offers 1 million tokens vs Opus 4.8’s 200K — and the depth of extended reasoning for highly complex tasks. Opus 4.8 remains the right choice for most complex agentic workflows at half the cost. Fable 5 is best when you need both maximum context and maximum reasoning depth simultaneously, or when a task has routinely hit the limits of what Opus can do reliably.

Claude Fable 5 is priced at $10 per million input tokens and $50 per million output tokens — 2× Opus 4.8 ($5/$25), 3.3× Sonnet 4.6 ($3/$15), and 10× Haiku 4.5 ($1/$5). Prompt caching drops the effective input cost to $1.00/M on cache reads, and the Batch API adds a 50% discount on all tokens for non-real-time workloads. Stacking both discounts makes Fable 5 viable for higher-volume use cases than the base price suggests.

Claude Fable 5 has a 1-million-token context window — approximately 750,000 words or roughly 1,500 pages of text. This is 5× the context window of Opus 4.8, Sonnet 4.6, and Haiku 4.5 (all 200K). In practice, a 1M context window lets you pass entire codebases, long research corpora, or full document archives in a single API call without chunking or retrieval workarounds. For more on context window mechanics, see our full context window guide.

Yes. Claude Fable 5 is available through the Anthropic API using the model ID claude-fable-5-20260101 (check the Anthropic documentation for the exact identifier). It supports the same API surface as the rest of the Claude family — streaming, tool use, prompt caching, vision, the Batch API, and MCP server integration. Access requires an Anthropic API account with Fable 5 enabled on your usage tier.

Fable 5 is available in Claude.ai on the Pro and Team plans. The interface lets you select it from the model picker when starting a conversation. Like Opus, Fable 5 in claude.ai has message limits that reset on a rolling window — it’s designed for individual complex tasks rather than high-volume API workloads. For production-scale usage, the API with the Batch API discount is the more economical path.

Yes — and Fable 5’s extended thinking is the deepest in the lineup. Where Opus 4.8 supports extended thinking for complex reasoning tasks, Fable 5 uses a more capable reasoning engine designed for tasks that require longer chains of inference, more working memory, and more reliable self-correction. It’s particularly effective on math, logic, long-horizon planning, and tasks where the model needs to hold and manipulate many interdependent concepts simultaneously.

For most content production — articles, blog posts, social copy, summaries, SEO content — Sonnet 4.6 is the right call. It produces high-quality output at 3.3× less cost than Fable 5, and for typical content lengths (500–3,000 words), the quality difference is minimal. Reach for Fable 5 when you need to synthesize across a very large corpus (e.g., auditing 200+ posts simultaneously), when the content requires deep domain reasoning that benefits from extended thinking, or when the task involves both large-context ingestion and complex output generation in a single pass.

Three levers in order of impact: (1) Model routing — only use Fable 5 when the task genuinely requires it; route everything else to Opus, Sonnet, or Haiku based on complexity and volume. (2) Prompt caching — structure your system prompt and shared context so it can be cached; cache reads cost $1.00/M instead of $10.00/M on Fable 5. (3) Batch API — submit non-real-time workloads via the Batch API for a flat 50% discount. Stacking all three — routing + caching + batch — can reduce effective per-task costs by 85–95% compared to unoptimized Fable 5 calls.

More Claude Guides from Tygart Media

We run Claude in production every day. These are the guides that come from using it, not just writing about it.

Context Window Guide
Claude Projects
Plan Mode
MCP Setup
Code vs Cursor

June 9, 2026

Claude Code vs Codex CLI (2026): A Hands-On Head-to-Head
Last verified: June 2026.

Both Claude Code and OpenAI Codex CLI are terminal-native coding agents: you run them inside a repo, they read your files, edit code, run commands, and iterate. I run both daily on real projects. This is the head-to-head I wish existed when I was deciding which one to make my default. No benchmarks-chasing, just install commands, config files, pricing math, and where each one actually earns its keep. For the broader toolchain these slot into, see our AI operator’s stack.

Claude Code vs Codex CLI: the short answer

If you want one sentence: Claude Code is the more mature agentic harness (subagents, hooks, skills, deep MCP, a flat-rate plan that makes heavy use affordable), while Codex CLI is the leaner, cheaper-per-token option with strong raw coding from the GPT-5.x line and a tight sandbox model. Most teams that live in the terminal all day end up on Claude Code for the workflow tooling; people who want a fast, low-cost agent on top of an existing OpenAI subscription reach for Codex.

The honest version: they are closer than tribal arguments suggest. The deciding factors are almost never “which model is smarter this week” and almost always pricing structure, sandbox defaults, and how much workflow scaffolding you need.

How do you install each one?

Claude Code installs from npm and runs as the claude command:
```
npm install -g @anthropic-ai/claude-code
cd your-project
claude
```
First run walks you through OAuth login (Pro/Max plan) or an ANTHROPIC_API_KEY. On Windows it runs natively in PowerShell now, though a lot of operators still prefer it under WSL for fewer path headaches.

Codex CLI ships an install script and is also on npm:
```
# Mac / Linux
curl -fsSL https://chatgpt.com/codex/install.sh | sh

# Windows (PowerShell)
powershell -ExecutionPolicy ByPass -c "irm https://chatgpt.com/codex/install.ps1 | iex"

# or via npm
npm install -g @openai/codex
```
Then codex in your repo. Auth is either a ChatGPT login (Plus/Pro/Business) or an OpenAI API key via codex login. Both tools are open-source clients hitting hosted models, so the install is the easy part; the model access is what you are really buying.

Which models do they run in 2026?

Claude Code defaults to the current Claude flagship. As of June 2026 that is Opus 4.8 for the hardest reasoning, with Sonnet 4.6 as the fast everyday workhorse and Haiku 4.5 for cheap, high-volume calls. You switch in-session with /model. Opus 4.8 also exposes reasoning-effort levels (high is the default; xhigh and max push deeper on gnarly problems at higher token cost).

Codex CLI runs the GPT-5.x coding line. GPT-5.5 is the current recommended default for complex coding and agentic work, GPT-5.4-mini is the faster/cheaper option for light tasks and subagents, and GPT-5.3-Codex remains a strong coding-tuned choice. Pick the model with codex -m gpt-5.5 or set it in your config.

Practical read: on a clean, well-specified function both produce good code. The gap shows up on long, multi-file refactors where the agent has to hold a lot of context and recover from its own mistakes. That is a harness problem as much as a model problem, which is the next section.

What about workflow features: subagents, hooks, and config?

This is where Claude Code is currently ahead, and it is the real reason it tends to win for power users.
- Subagents – Claude Code spawns isolated sub-sessions with their own context window, tool restrictions, and prompts. Great for “go research this in parallel while the main thread keeps coding.” Codex has a lighter subagent concept (often pointed at GPT-5.4-mini to keep cost down) but it is less fleshed out.
- Hooks – Claude Code fires deterministic scripts at lifecycle points (PreToolUse, UserPromptSubmit, and more). These run real code, so they cannot hallucinate: you can hard-block a dangerous command, auto-format on every edit, or inject context before the model sees a prompt. Codex leans on its approval/sandbox policy and execpolicy rules instead of a general hook system.
- Skills and slash commands – In Claude Code, custom slash commands have merged into skills; /your-command still works and skills add reusable, packaged capabilities. Codex uses prompt files and profiles rather than a skills layer.
- Project memory – Both read a project instruction file. Claude Code uses CLAUDE.md; Codex uses AGENTS.md (checked in a fallback order including AGENTS.override.md and .agents.md). Keep these tight: architecture, conventions, and the few rules the agent keeps forgetting.
Codex’s config story is clean if you like a single file: ~/.codex/config.toml holds your model, approval policy, sandbox mode, MCP servers, and named profiles you switch with codex --profile work. Claude Code spreads config across ~/.claude/ and .claude/settings.json plus per-project files, which is more surface area but more granular control.

How do the sandbox and approval models compare?

This matters more than most comparisons admit, because it governs how much the agent can do without asking.

Codex CLI has an explicit, well-documented sandbox. Sandbox modes run from read-only to workspace-write (edit files in the project, network off by default) up to full access, paired with approval policies like untrusted and on-request. On Windows the native sandbox can run unelevated or elevated. The mental model is clear: pick how much rope, then approve escalations.

Claude Code manages permissions through allow/deny rules and modes (including a plan mode that reasons without touching files, and an auto-accept mode for trusted loops). Combined with PreToolUse hooks you can build a strict policy, but it is more “assemble it yourself” than Codex’s preset sandbox tiers.

If you are dropping an agent onto an unfamiliar or sensitive repo, start read-only in both. Codex makes that posture a one-flag default; Claude Code gives you finer-grained control once you invest in the config.

Do both support MCP?

Yes, and this is a genuine tie that matters. Both speak the Model Context Protocol, so you can wire in the same external tools, databases, and APIs. Codex registers STDIO or streaming-HTTP MCP servers in ~/.codex/config.toml and launches them at session start. Claude Code adds servers via claude mcp add or JSON config. If you have already built MCP integrations, neither tool locks you out. New to MCP, start with our Claude MCP setup guide and the Notion MCP setup walkthrough.

What does each one cost?

Pricing is where the decision often gets made, so here are the real numbers as of June 2026.

Claude Code plans:
- Pro – $20/mo: Sonnet 4.6 plus some Opus, roughly enough for focused daily sessions, not all-day heavy use.
- Max 5x – $100/mo: much larger windows, real Opus headroom.
- Max 20x – $200/mo: the heavy-user tier; effectively flat-rate firehose access.
- API pay-as-you-go: Opus 4.7 about $5/$15 per million input/output… (current Opus tier runs higher), Sonnet 4.6 $3/$15, Haiku 4.5 $1/$5.
Codex CLI: Included in ChatGPT Plus/Pro/Business plans (usage governed by your plan’s limits), or pay-as-you-go on the API. GPT-5.3-Codex runs about $1.75 per million input / $14 per million output, with cheaper input on cached tokens. The mini model is far cheaper for light work.

The structural difference: Claude Code’s Max plans are flat-rate, which is why heavy users love them. People have tracked billions of tokens that would cost five figures on API metering but ran around a few hundred dollars on Max. Codex’s per-token rates are lower per unit and great if your usage is bursty or already bundled into a ChatGPT subscription, but a true all-day agent habit can run up metered cost faster than a flat plan. Estimate your monthly token volume honestly, then do the arithmetic both ways.

So which coding agent should you actually use?

Pick Claude Code if you want the deepest agentic workflow (subagents, hooks, skills), you are a heavy daily user who benefits from the flat-rate Max plan, or you need fine-grained, scriptable control over what the agent can do. It is the more complete operator’s harness in 2026.

Pick Codex CLI if you want lower per-token cost, you already pay for ChatGPT and want to use that allowance, you like the clean preset sandbox/approval model, or you simply prefer the GPT-5.x output style. It is lean, fast to stand up, and genuinely capable.

The move a lot of us make: run both. They are cheap relative to engineer time, they share MCP servers, and they have different failure modes. When one gets stuck in a loop on a hard bug, handing the same task to the other with fresh context often breaks the logjam. If you are weighing terminal agents against IDE-native ones, our Claude Code vs Cursor breakdown covers that axis.

Frequently asked questions

Is Claude Code or Codex CLI better for large refactors?

Claude Code tends to hold up better on long multi-file refactors, mostly because of subagents and hooks that keep context organized and catch mistakes deterministically. Codex can do it too, especially with GPT-5.5, but you lean harder on tight AGENTS.md instructions and approval gates.

Can I use Codex CLI without a ChatGPT subscription?

Yes. Run codex login with an OpenAI API key and you pay per token instead of through a ChatGPT plan. Same for Claude Code with an ANTHROPIC_API_KEY if you would rather meter than subscribe.

Do they work on Windows natively?

Both do in 2026. Claude Code runs in PowerShell (many operators still prefer WSL for cleaner paths), and Codex CLI has a native Windows installer plus a Windows sandbox with unelevated/elevated modes. Watch out for shells that mangle /tmp or C:\ style paths in arguments.

What is the single biggest difference?

Pricing structure and workflow depth. Claude Code offers flat-rate Max plans and a richer harness (subagents, hooks, skills); Codex offers lower per-token rates and a cleaner preset sandbox. Model quality is close enough that those two factors usually decide it.

Which model do they run by default?

Claude Code defaults to the current Claude flagship (Opus 4.8 as of June 2026, with Sonnet 4.6 for everyday speed). Codex CLI recommends GPT-5.5 for complex work, with GPT-5.4-mini and GPT-5.3-Codex as alternatives. Switch in-session with /model or the -m flag.

How do I get either tool cited or surfaced by AI engines for my own docs?

That is a content question, not a tooling one. The same structure that makes this page answerable, short factual answers, question-shaped headers, and a visible FAQ, is what AI engines reward. See how AI engines cite content for the full playbook.
June 3, 2026

Tag: AI Comparison

Azure Neural TTS vs Google Cloud Text-to-Speech: Audio Versions of Every Article

The free-tier ceilings

How we do it

Voice quality and SSML control

How we do it

The accessibility and SEO payoff

How we do it

What surprised us

The takeaway

Frequently asked questions

Azure Translator vs Google Cloud Translation: 2M Free Characters, Tested

The free-tier ceilings

How we do it

Translation quality and nuance

How we do it

Language coverage and document mode

How we do it

What surprised us

The takeaway

Frequently asked questions

Pricing at a glance

How Fable 5 compares to Opus, Sonnet, and Haiku

Where you can use Fable 5

Consumer-plan access and the promotional window

Integration notes that affect cost and handling

How much does Claude Fable 5 cost?

Is Fable 5 more expensive than Claude Opus 4.8?

Which claude.ai plans include Fable 5?

What is the difference between Fable 5 and Mythos 5?

The decision matrix

Where billing actually differs

SDK vs Managed Agents: the programmatic split

Quick chooser

Is Claude Cowork the same as Claude Code?

Does a Claude subscription cover the Agent SDK or Managed Agents?

Where does the work actually execute for each surface?

Is the Agent SDK built on Claude Code?

Is Managed Agents generally available?

Models and pricing (verified specs)

Coding benchmark scores (primary-source only)

What we could not verify from a primary source

How to read a coding leaderboard

Which model has the highest published coding benchmark score in June 2026?

What does Claude Fable 5 cost, and how is it different from Opus 4.8?

Why are some benchmark cells marked “not machine-verifiable” instead of showing a number?

How do the context windows compare?

Is Terminal-Bench comparable across these models?

Current Claude Models — Quick Reference

Claude Fable 5 (June 9, 2026)

Claude Opus 4.8

Claude Sonnet 5

Claude Haiku 4.5

How to Choose the Right Claude Model

Claude Subscription Plans and Model Access (June 2026)

Legacy Models (Still Available, No Longer Latest)

Platform Availability

Frequently Asked Questions

Changelog

What Is Claude Fable 5?

Claude Fable 5 Pricing

Benchmark Performance: Where Fable 5 Pulls Away

When to Use Claude Fable 5 (vs Opus 4.8)

Important Limitations to Know Before You Switch

Availability

How This Changes the Claude Model Decision Tree

Frequently Asked Questions

June 10, 2026: The Day the Lanes Forked

Claude Fable 5: Mythos-Class AI Goes Public

Meanwhile, I Was Building the Infrastructure Layer — Not Reading About It

OpenAI’s S-1 and the $965 Billion Elephant

AI Passed the Turing Test. Now What?

Chrome WebMCP: The Browser Becomes an AI Endpoint

The GPU Infrastructure Play: xAI Becomes an AI REIT

The Security Layer Nobody’s Talking About

What This All Means: The Two-Lane Highway

Frequently Asked Questions

What is Claude Fable 5 and how does it differ from Claude Opus?

What is the Chrome WebMCP API and why does it matter for businesses?

What is the WordPress 7.0 Abilities API?