Claude Extended Thinking: When and How to Use It

Q: Does extended thinking always give better answers?

No. It improves complex reasoning tasks but adds latency. Standard mode is faster for simple tasks.

Last verified: June 13, 2026

Claude AI · Fitted Claude

Extended thinking is Claude’s most powerful reasoning mode — and the one most people never use correctly. This guide explains what extended thinking does, when it genuinely improves outputs, how to enable it, and when you’re better off with a standard prompt.

What Is Extended Thinking?

Extended thinking gives Claude a dedicated reasoning phase before generating its final response. Claude works through a problem on “scratch paper” before writing its answer — exploring multiple approaches, identifying errors in its own reasoning, and building a more deliberate chain of thought. On current Claude models (Fable 5, Opus 4.8, Sonnet 4.6), this works through adaptive thinking — Claude dynamically decides when and how much to think based on problem complexity, instead of you setting a fixed token budget.

When Extended Thinking Genuinely Helps

Complex math and logic problems requiring step-by-step reasoning
Multi-step coding tasks with many interdependent components
Strategic analysis requiring weighing many variables
Difficult research synthesis where accuracy matters most
Any task where “think step by step” would help — extended thinking does this automatically

When Extended Thinking Is Overkill

Simple factual questions with clear answers
Routine writing tasks (emails, summaries, short copy)
Format conversion or data transformation
Tasks where speed matters more than depth

How to Enable Extended Thinking

In Claude.ai: Look for the thinking toggle before sending your message.

Via API (current models — Fable 5, Opus 4.8, Sonnet 4.6): Use adaptive thinking and let Claude decide the depth:

"thinking": { "type": "adaptive" },
"output_config": { "effort": "high" }

The effort parameter — low, medium, high, xhigh, or max — controls how much Claude thinks and how many tokens it spends. Default is high. Use xhigh for coding and agentic work; low for fast, simple tasks. (xhigh is available on Fable 5 and Opus 4.7+; max on Opus-tier 4.6+ and Sonnet 4.6.)

⚠ budget_tokens is deprecated. The old "thinking": {"type": "enabled", "budget_tokens": N} form returns a 400 error on Fable 5, Opus 4.7, and Opus 4.8. It is deprecated but still functional on Opus 4.6 and Sonnet 4.6. Only legacy models (e.g. Sonnet 4.5) still require it, where budget_tokens must be less than max_tokens. For all new code, use adaptive thinking plus effort instead.

What You See During Extended Thinking

In claude.ai, Claude shows a collapsed “thinking” section before its response. Expand it to see the reasoning chain — useful for verifying logic or understanding how Claude approached a problem. The thinking section is exploratory and may contain dead ends; this is normal.

Via API, on Fable 5, Opus 4.7, and Opus 4.8 the thinking text is omitted by default (the blocks stream but their text is empty). To see summarized reasoning, request "thinking": {"type": "adaptive", "display": "summarized"}.

Frequently Asked Questions

Does extended thinking always give better answers?

No. It improves accuracy on complex reasoning tasks but adds latency. For simple tasks, standard mode is faster and just as accurate.

Need this set up for your team?
Talk to Will →

What to explore next

Anthropic

What Is Claude Trained On? Training Data, Methods, and Cutoff Dates

Same room

Anthropic

Claude Updates May–June 2026: Opus 4.8, SpaceX Compute, Managed Agents Memory, and What’s Coming Next

Same room

Agency Playbook

Notion Client Portal Setup for Agencies: How We Build Ours

You may also explore

Deep dive

The Lab

The Knowledge Compression Project: Can a Song Teach Faster Than Prose?

Deep dive

Track the AI tools you actually use

Live, vendor-neutral prices & limits for ChatGPT, Claude, Gemini, Perplexity and more — and we’ll email you the moment your tools change price or limits. Free, no hype.

See the live AI tracker →or set up your alerts

Claude Extended Thinking: When and How to Use It

What Is Extended Thinking?

When Extended Thinking Genuinely Helps

When Extended Thinking Is Overkill

How to Enable Extended Thinking

What You See During Extended Thinking

Frequently Asked Questions

Does extended thinking always give better answers?

Comments

Leave a Reply Cancel reply

More posts

AI Agents Are Learning to Check Instead of Guess: The GitHub Context Problem

Logic Apps vs Cloud Workflows: No-Code Automation Across Two Clouds

Azure Static Web Apps vs Firebase Hosting: A Dashboard on Each

Cosmos DB vs Firestore: A Free-Tier Operations Ledger on Both Clouds