Category: Tygart Media Editorial

Tygart Media’s core editorial publication — AI implementation, content strategy, SEO, agency operations, and case studies.

  • Claude 4 Release Date & Deprecation: What’s Changing June 2026

    Claude 4 Release Date & Deprecation: What’s Changing June 2026

    Model Accuracy Note — Updated May 2026

    Current flagship: Claude Opus 4.7 (claude-opus-4-7). Current models: Opus 4.7 · Sonnet 4.6 · Haiku 4.5. Claude Opus 4.6 referenced in this article has been superseded. See current model tracker →

    Claude AI · Fitted Claude

    Anthropic hasn’t announced a specific “Claude 4” as a distinct release — the current model generation is the Claude 4.x series, with Claude Opus 4.6 and Claude Sonnet 4.6 as the current flagship models. If you’re searching for Claude 4, you’re likely looking for the current generation. Here’s exactly what’s live, what the naming means, and what to watch for next.

    Current status (April 2026): The Claude 4.x model family is live. Claude Opus 4.6 (claude-opus-4-6) and Claude Sonnet 4.6 (claude-sonnet-4-6) are Anthropic’s current production models. These are the “Claude 4” generation.

    The Current Claude 4.x Lineup

    Model API String Status Position
    Claude Opus 4.6 claude-opus-4-6 ✅ Live Flagship / maximum capability
    Claude Sonnet 4.6 claude-sonnet-4-6 ✅ Live Production default / balanced
    Claude Haiku 4.5 claude-haiku-4-5-20251001 ✅ Live Speed / cost efficiency

    Claude Model Naming: How It Works

    Anthropic uses a generation.version naming convention. The “4” in Claude 4.6 denotes the fourth major model generation. The “.6” is a version within that generation — a meaningful update that improves on the generation’s base capabilities without being an entirely new architecture.

    This is why there’s no single “Claude 4 release date” to point to — the Claude 4.x family has been rolling out incrementally, with different model tiers (Haiku, Sonnet, Opus) shipping at different points within the generation. The generation is live; you’re using it now if you’re on current Claude models.

    Claude 4 vs Claude 3: What Changed

    The jump from Claude 3.x to Claude 4.x brought improvements across reasoning, coding accuracy, instruction-following, and agentic capability. Claude 3.5 Sonnet — released in mid-2024 — was the model that first clearly demonstrated Claude could compete with and often exceed GPT-4o on most professional benchmarks. The 4.x series extended those gains.

    The most notable improvements in the 4.x generation: stronger performance on multi-step reasoning, better coherence in long agentic sessions, and improved accuracy on coding tasks including the SWE-bench benchmark for real-world software engineering.

    What Comes After Claude 4.x

    Anthropic hasn’t announced a Claude 5 release date or feature set. Based on the pace of releases — major generations arriving every several months, point releases more frequently — the next major generation will likely arrive within the year. When it does, the pattern will hold: the new mid-tier model (Sonnet) will likely outperform the current top-tier (Opus) on most tasks, at a fraction of the cost.

    For anticipation content on the next Sonnet release, see Claude Sonnet 5: What We Know. For the current model API strings and specs, see Claude API Model Strings — Complete Reference.

    Frequently Asked Questions

    When does Claude 4 come out?

    Claude 4 is already out — the current model generation is Claude 4.x. Claude Opus 4.6 and Claude Sonnet 4.6 are live and in production as of April 2026. There’s no separate “Claude 4” launch pending; you’re on it.

    What is Claude 4?

    Claude 4 refers to Anthropic’s fourth major model generation — currently the Claude 4.x series including Opus 4.6, Sonnet 4.6, and Haiku 4.5. The generation brought improvements in reasoning, coding, instruction-following, and agentic performance over Claude 3.

    Is Claude 4 better than Claude 3?

    Yes, across most benchmarks and practical tasks. The Claude 4.x generation improves on Claude 3 in reasoning accuracy, coding performance, long-context coherence, and agentic capability. Claude 3.5 Sonnet — the bridge between generations — was the model that first demonstrated Claude could consistently outperform GPT-4o on professional tasks.

    Need this set up for your team?
    Talk to Will →

    Current Model Status — May 8, 2026

    There is no “Claude 4” as a standalone release. The current generation is the Claude 4.x series. The flagship model right now is Claude Opus 4.7 — released April 16, 2026.

    Model API String Status
    Claude Opus 4.7 claude-opus-4-7 ✓ Current flagship
    Claude Sonnet 4.6 claude-sonnet-4-6 ✓ Current
    Claude Haiku 4.5 claude-haiku-4-5-20251001 ✓ Current
    Claude Sonnet 4 / Opus 4 claude-*-4-20250514 ⚠ Retiring June 15, 2026
    Claude Haiku 3 claude-3-haiku-20240307 ✗ Retired — returns error

    Source: Anthropic API release notes · Updated May 8, 2026

  • Claude Haiku vs Sonnet vs Opus: The Complete Three-Model Comparison

    Claude Haiku vs Sonnet vs Opus: The Complete Three-Model Comparison

    Model Accuracy Note — Updated May 2026

    Current flagship: Claude Opus 4.7 (claude-opus-4-7). Current models: Opus 4.7 · Sonnet 4.6 · Haiku 4.5. Claude Opus 4.6 referenced in this article has been superseded. See current model tracker →

    Claude AI · Fitted Claude

    Choosing between Claude’s three models comes down to one question: how hard is the task, and how much does cost matter? Haiku, Sonnet, and Opus each occupy a distinct position — this is the complete three-way breakdown so you can route work correctly from the start.

    The routing rule in one sentence: Haiku for volume and speed, Sonnet for almost everything else, Opus for the tasks where Sonnet isn’t quite enough.

    Haiku vs Sonnet vs Opus: Full Comparison

    Spec Haiku Sonnet Opus
    API string claude-haiku-4-5-20251001 claude-sonnet-4-6 claude-opus-4-7
    Input price (per M tokens) ~$1.00 ~$3.00 ~$5.00
    Output price (per M tokens) ~$5.00 ~$5.00 ~$25.00
    Context window 200K 1M 1M
    Speed ⚡ Fastest ⚡ Fast 🐢 Slower
    Reasoning depth Good Excellent Maximum
    Writing quality Good Excellent Maximum
    Cost vs Sonnet ~4× cheaper ~5× more expensive

    Claude Haiku: The Volume Model

    Haiku is optimized for tasks that are high in quantity but low in complexity — situations where you’re running the same operation hundreds or thousands of times and cost per call is a real constraint. Classification, extraction, summarization, metadata generation, routing logic, short-form responses, and real-time features where latency matters more than depth.

    The output quality on constrained tasks is strong. Where Haiku shows its limits is on open-ended, nuanced work — multi-step reasoning, long-form writing where voice consistency matters, or problems with competing constraints. For those, Sonnet is the right call.

    Claude Sonnet: The Default

    Sonnet handles the vast majority of professional work at a quality level that’s indistinguishable from Opus for most tasks. Writing, analysis, research, coding, summarization, strategy — Sonnet does all of it well. It’s the model to start with and the one most people should use as their production default.

    The gap between Sonnet and Opus shows on genuinely hard tasks: novel multi-step reasoning, edge cases in complex code, nuanced judgment in ambiguous situations, or extended agentic sessions where small quality differences compound. For everything else, Sonnet is the right choice and a fraction of the cost.

    Claude Opus: The Specialist

    Opus earns its premium on tasks where maximum capability is the only variable that matters and cost is secondary. Complex legal or technical analysis, research synthesis across conflicting sources, architectural decisions with long-term consequences, extended agentic sessions, and any task where you’ve tried Sonnet and felt the output was a notch below what the problem deserved.

    The practical test: if Sonnet’s output on a task is good enough, use Sonnet. Only reach for Opus when you’ve genuinely hit Sonnet’s ceiling on a specific problem. Most professionals do this on a small fraction of their actual workload.

    The Decision Framework

    Use Haiku when: same operation at high volume, output is constrained/structured, cost and speed matter, real-time latency required.

    Use Sonnet when: any standard professional task — writing, coding, analysis, research. This should be your default 90% of the time.

    Use Opus when: the task is genuinely hard, involves novel reasoning, Sonnet’s output wasn’t quite right, or quality is the only variable that matters regardless of cost.

    For full pricing details, see Anthropic API Pricing. For a Haiku deep-dive, see Claude Haiku: Pricing, Use Cases, and API String. For the Opus vs Sonnet head-to-head, see Claude Opus vs Sonnet.

    Frequently Asked Questions

    What’s the difference between Claude Haiku, Sonnet, and Opus?

    Haiku is fastest and cheapest — built for high-volume, constrained tasks. Sonnet is the balanced production default with excellent quality across most professional work. Opus is the most capable model for complex reasoning — about 5× more expensive than Sonnet on input tokens.

    Which Claude model should I use?

    Start with Sonnet for almost everything. Switch to Haiku when you’re running the same operation at high volume and cost matters. Switch to Opus when Sonnet’s output on a specific task isn’t quite at the level the problem requires.

    Is Claude Haiku good enough for most tasks?

    For structured, constrained tasks — yes, Haiku is strong. For open-ended writing, complex reasoning, or work requiring nuanced judgment, Sonnet is the right step up. The cost savings from Haiku are meaningful at scale, making it the right choice when the task fits its strengths.

    Need this set up for your team?
    Talk to Will →

  • Claude Integrations and Plugins: Complete List of What Claude Can Connect To

    Claude Integrations and Plugins: Complete List of What Claude Can Connect To

    Claude AI · Fitted Claude

    Claude doesn’t use a traditional plugin marketplace — instead, it connects to external tools and services through MCP (Model Context Protocol), an open standard that lets any service build a Claude integration. Here’s a complete rundown of what Claude can connect to in 2026, how those connections work, and how to set them up.

    How Claude integrations work: Claude uses MCP (Model Context Protocol) instead of plugins. Services publish an MCP server; Claude connects to it and gains access to that service’s capabilities. In Claude.ai, many integrations are available in Settings → Connections. In Claude Desktop and the API, you can connect to any MCP server.

    Claude Integrations Available in Claude.ai (2026)

    Service What Claude can do Available in
    Google Drive Search, read, and analyze documents Claude.ai
    Google Calendar Read and create calendar events Claude.ai
    Gmail Read, search, and draft emails Claude.ai
    Notion Read and write pages, query databases Claude.ai
    Slack Read channels, search messages, post Claude.ai
    GitHub Read repos, create issues, review PRs Claude Desktop / API
    Zapier Trigger automations across 6,000+ apps Claude.ai
    HubSpot Read and update CRM records Claude.ai
    Cloudflare Manage workers, DNS, and infrastructure Claude Desktop / API
    PostgreSQL / databases Query, read schema, analyze data Claude Desktop / API
    File system Read, write, organize local files Claude Desktop
    Web search Search the web for current information Claude.ai (built-in)
    Jira / Linear Read and create issues, update status Claude.ai / API
    Custom APIs Any service with an MCP server Claude Desktop / API

    How to Add Integrations in Claude.ai

    1. Go to claude.ai → Settings → Connections
    2. Browse the available integrations and click Connect on any you want to enable
    3. Authenticate with the service (usually OAuth — you’ll be redirected to authorize)
    4. Once connected, Claude can use that service in your conversations when relevant

    Claude Desktop: More Integrations, More Control

    The Claude Desktop app supports MCP server configuration via a JSON config file — giving you access to any MCP server, including self-hosted ones and community-built integrations that aren’t in the official Claude.ai connection list. This is where the integration ecosystem expands beyond the curated set: database connections, local file systems, internal tools, and any API where someone has built an MCP server.

    Building Your Own Claude Integration

    Any developer can build an MCP server and connect it to Claude. Anthropic publishes the MCP spec openly — you implement the server, and Claude can immediately use whatever tools or data you expose. This is how companies integrate Claude into proprietary internal systems without exposing data to a third party. For the technical implementation, see the Claude MCP guide.

    Frequently Asked Questions

    Does Claude have plugins?

    Claude doesn’t use a plugin marketplace like early ChatGPT did. Instead it uses MCP (Model Context Protocol) — an open standard where services publish integration servers that Claude connects to. In Claude.ai, these appear as “Connections” in Settings. Claude Desktop supports any MCP server via config file.

    What apps can Claude connect to?

    Claude can connect to Google Drive, Gmail, Google Calendar, Notion, Slack, Zapier, HubSpot, GitHub, Cloudflare, databases, local file systems, and any service that has published an MCP server. The ecosystem is growing rapidly — new MCP servers are added by third-party developers regularly.

    How do I add integrations to Claude?

    In Claude.ai, go to Settings → Connections and authenticate the services you want to connect. For Claude Desktop, integrations are configured via a JSON config file that specifies which MCP servers to load. Via the API, you pass MCP server URLs in your request parameters.

    Need this set up for your team?
    Talk to Will →

  • Claude Haiku: Pricing, API String, Use Cases, and When to Use It

    Claude Haiku: Pricing, API String, Use Cases, and When to Use It

    Claude AI · Fitted Claude

    Claude Haiku is Anthropic’s fastest and most cost-efficient model — the right choice when you need high-volume AI at low cost without sacrificing the quality that makes Claude worth using. It’s not a cut-down version of the flagship models. It’s a purpose-built model for the tasks where speed and cost matter more than maximum reasoning depth.

    When to use Haiku: Any time you’re running the same operation across many inputs — classification, extraction, summarization, metadata generation, routing logic, short-form responses — and cost or speed is a meaningful constraint. Haiku handles these at a fraction of Sonnet’s price with output quality that’s more than sufficient.

    Claude Haiku Specs (April 2026)

    Spec Value
    API model string claude-haiku-4-5-20251001
    Context window 200,000 tokens
    Input pricing ~$1.00 per million tokens
    Output pricing ~$5.00 per million tokens
    Speed vs Sonnet Faster — optimized for low latency
    Batch API discount ~50% off (~$0.50 input / ~$2.50 output)

    Claude Haiku vs Sonnet vs Opus

    Model Input cost Speed Reasoning depth Best for
    Haiku ~$1.00/M Fastest Good High-volume, latency-sensitive
    Sonnet ~$3.00/M Fast Excellent Production workloads, daily driver
    Opus ~$5.00/M Slower Maximum Complex reasoning, highest quality

    What Claude Haiku Is Best At

    Haiku is optimized for tasks where the output is constrained and the logic is clear — not open-ended creative or strategic work where maximum capability pays off. The practical use cases where Haiku earns its position:

    • Classification and routing — is this a support ticket, a bug report, or a feature request? Tag it and route it. Haiku handles thousands of these per hour at minimal cost.
    • Extraction — pull the names, dates, dollar amounts, or addresses from a document. Structured output from unstructured text at scale.
    • Summarization — condense articles, emails, or documents to key points. Haiku’s summarization is strong enough for most production use cases.
    • SEO metadata — generate title tags, meta descriptions, alt text, and schema markup in bulk. This is where Haiku shines for content operations.
    • Short-form responses — FAQ answers, product descriptions, short explanations. Anything where the output is a few sentences or a structured short block.
    • Real-time features — chatbots, autocomplete, inline suggestions — anywhere latency affects user experience.

    Claude Haiku vs GPT-4o Mini

    GPT-4o mini is OpenAI’s comparable low-cost model and is less expensive than Haiku per token. The cost trade-off is real — GPT-4o mini is cheaper. The quality trade-off depends on the task. For instruction-following on complex structured outputs, Haiku tends to be more reliable. For simple, high-volume tasks where the output format is forgiving, the cost difference may favor GPT-4o mini. For teams already building on Claude for quality reasons, Haiku is the natural choice for high-volume work within that stack.

    Using Claude Haiku in the API

    import anthropic
    
    client = anthropic.Anthropic()
    
    message = client.messages.create(
        model="claude-haiku-4-5-20251001",
        max_tokens=256,
        messages=[
            {"role": "user", "content": "Classify this support ticket: ..."}
        ]
    )
    
    print(message.content)

    For a full model comparison, see Claude Models Explained: Haiku vs Sonnet vs Opus. For API pricing across all models, see Anthropic API Pricing.

    Frequently Asked Questions

    What is Claude Haiku?

    Claude Haiku is Anthropic’s fastest and most affordable model — approximately $1.00 per million input tokens. It’s purpose-built for high-volume, latency-sensitive tasks like classification, extraction, summarization, and short-form generation where cost efficiency matters more than maximum reasoning depth.

    How much does Claude Haiku cost?

    Claude Haiku costs approximately $1.00 per million input tokens and $5.00 per million output tokens. The Batch API reduces these to approximately $0.40 input and $2.00 output — roughly half price for non-time-sensitive workloads.

    When should I use Claude Haiku instead of Sonnet?

    Use Haiku when your task is well-defined with a constrained output, you’re running it at high volume, and cost or latency is a meaningful consideration. Use Sonnet when the task is complex, requires nuanced reasoning, or produces longer open-ended outputs where maximum quality matters.

    What is the Claude Haiku API model string?

    The current Claude Haiku model string is claude-haiku-4-5-20251001. Always verify the current string in Anthropic’s official model documentation before production deployment.

    Need this set up for your team?
    Talk to Will →

  • Can Claude Read PDFs? Yes — Here’s Exactly How It Works

    Can Claude Read PDFs? Yes — Here’s Exactly How It Works

    Quick Answer

    Yes. Claude can read and analyze PDF files up to 32MB. You can upload PDFs directly in Claude.ai or pass them via the API as base64-encoded documents.

    File size limits, supported formats, and API usage below.

    Claude AI · Fitted Claude

    Yes — Claude can read PDFs. You can upload a PDF directly to Claude.ai and ask questions about it, summarize it, extract specific information, or have Claude analyze its contents. Here’s exactly how it works, what the limits are, and what Claude does particularly well with PDF documents.

    How to upload a PDF: In Claude.ai, click the paperclip icon in the message box, select your PDF, and it uploads instantly. Then ask your question. Claude reads the full document and responds based on its contents.

    What Claude Can Do With a PDF

    Task Works well? Notes
    Summarize the document ✅ Excellent Full document or by section
    Answer questions about content ✅ Excellent Finds specific facts, quotes, data points
    Compare multiple PDFs ✅ Strong Upload multiple files in one session
    Extract tables and data ✅ Strong Works best on text-based tables
    Analyze contracts and legal docs ✅ Strong Identifies clauses, flags issues, explains terms
    Read scanned / image PDFs ⚠️ Limited Requires text layer — pure image scans may not work
    Translate PDF content ✅ Strong Ask Claude to translate after uploading
    Fill in or edit the PDF file ❌ No Claude reads PDFs, doesn’t modify them

    PDF Size Limits

    Claude supports PDFs up to 32MB per file and up to 100 pages. Documents within that range load fully — Claude reads the entire content, not just the first few pages. For longer documents, you may need to split them or work section by section.

    The 200,000 token context window means very long text-heavy PDFs are handled well. A 200-page research paper, a full contract stack, or a lengthy financial report typically fits within the context window without truncation. See the Claude Context Window guide for the full breakdown.

    Scanned PDFs: The Limitation to Know

    Claude reads PDFs by processing the text layer — the actual characters embedded in the file. Most modern PDFs created from Word, Google Docs, or similar tools have a full text layer and work perfectly. Scanned documents — where pages are photographs of physical paper — may have no text layer, just images of text. Claude’s ability to read these depends on whether the PDF includes OCR text alongside the image.

    If Claude returns a response suggesting it can’t read the content, the PDF is likely a pure image scan without a text layer. Running the PDF through OCR software first will resolve it.

    Best Prompts for PDF Analysis

    Summarization: “Summarize this document in 3 paragraphs. Focus on the key findings, recommendations, and any action items.”

    Contract review: “Review this contract and flag: (1) any clauses that are unusually favorable to the other party, (2) missing standard protections, (3) ambiguous language that should be clarified.”

    Data extraction: “Extract all financial figures from this report and organize them into a table: metric, value, and the time period it covers.”

    Multi-document comparison: “I’ve uploaded two versions of this agreement. Identify every difference between them.”

    PDF Reading via the API

    Developers can send PDFs to Claude via the API using base64-encoded file content. Claude processes the document and responds to your prompt based on its contents — the same way it works in the web interface. This enables automated document processing pipelines: contract analysis at scale, research synthesis, financial document review, and more. See the Claude API tutorial for implementation details.

    Frequently Asked Questions

    Can Claude read PDFs?

    Yes. Upload a PDF directly in Claude.ai by clicking the attachment icon. Claude reads the full document content and can summarize, answer questions, extract data, compare documents, and analyze contracts. The limit is 32MB and 100 pages per file.

    Can Claude read scanned PDFs?

    Claude reads PDFs by processing the text layer. Scanned PDFs that are pure images without a text layer may not work — Claude needs text to process, not just an image of text. If your scan was run through OCR and has a text layer embedded, it will work. Otherwise, run OCR first.

    How many PDFs can I upload to Claude at once?

    You can upload multiple PDFs in a single conversation — as long as their combined text content fits within Claude’s 1 million token context window (for Sonnet and Opus) or 200,000 tokens (Haiku). For most document types, that means dozens of typical-length files can be analyzed together.

    Does Claude save or store uploaded PDFs?

    Claude processes PDFs within the conversation context. Anthropic’s standard data handling applies — on Free and Pro plans, conversations including uploaded files may be used for model improvement unless you opt out. For sensitive documents, review Claude’s privacy policy and consider Enterprise for stronger data handling.

    Need this set up for your team?
    Talk to Will →

  • Claude System Prompt Guide: How to Write Them, Examples, and Best Practices

    Claude System Prompt Guide: How to Write Them, Examples, and Best Practices

    Copy-Paste System Prompt Template

    # Role
    You are [ROLE] for [COMPANY/CONTEXT].
    
    # What you do
    [1-2 sentences describing the primary task]
    
    # Rules
    - Always [REQUIRED BEHAVIOR]
    - Never [PROHIBITED BEHAVIOR]
    - When unsure, [FALLBACK BEHAVIOR]
    
    # Output format
    [Describe structure: bullet list / JSON / prose / table]
    
    # Tone
    [Professional / casual / technical / concise]

    Real Example — Customer Support Agent

    # Role
    You are a support agent for Acme SaaS. You help users troubleshoot billing and account issues.
    
    # What you do
    Answer questions about invoices, plan changes, and cancellations using only confirmed account data.
    
    # Rules
    - Always verify the user's account email before discussing billing details
    - Never promise refunds — escalate to billing@acme.com
    - When unsure, say "Let me check on that" and ask one clarifying question
    
    # Output format
    Short paragraphs. One idea per paragraph. No bullet lists unless listing steps.
    
    # Tone
    Friendly and direct. No corporate filler phrases.
    Claude AI · Fitted Claude

    A system prompt is the instructions you give Claude before the conversation begins — the context, persona, rules, and constraints that shape every response in the session. It’s the most powerful lever you have for controlling Claude’s behavior at scale, and the foundation of any serious Claude integration. Here’s how system prompts work, how to write them well, and real examples across common use cases.

    What a system prompt does: Sets Claude’s role, knowledge, tone, constraints, and output format before the user says anything. Claude treats system prompt instructions as authoritative — they persist throughout the conversation and take priority over conflicting user requests within the boundaries Anthropic allows.

    System Prompt Structure: The Five Elements

    A well-structured system prompt typically covers these elements — not all are required for every use case, but the strongest prompts address most of them:

    # Role
    You are [specific role/persona]. [1-2 sentences on expertise and perspective].

    # Context
    [What this system/application/conversation is for. Who the user is. What they’re trying to accomplish.]

    # Instructions
    [Specific behaviors: what to do, how to format responses, how to handle edge cases]

    # Constraints
    [What NOT to do. Topics to avoid. Format rules to enforce. Information not to share.]

    # Output format
    [How Claude should structure its responses: length, format, sections, tone]

    System Prompt Examples by Use Case

    Customer Support Agent

    You are a customer support agent for Acme Software. You help users with account questions, billing issues, and technical troubleshooting for Acme’s project management platform.

    Tone: professional, patient, solution-focused. Never dismissive.

    For billing questions: provide information but escalate refund requests to billing@acme.com.
    For technical issues: follow the troubleshooting guide below before escalating.
    Never discuss: competitor products, internal pricing strategy, unreleased features.

    Always end with: “Is there anything else I can help you with today?”

    Code Assistant

    You are a senior software engineer helping with Python and TypeScript code.

    When writing code: use type hints in Python, strict TypeScript, and always include error handling. Prefer explicit over implicit. Comment non-obvious logic.

    When reviewing code: flag issues by severity (critical/high/medium/low). Always explain why something is a problem, not just that it is.

    Never write code without error handling. Never use eval(). Never hardcode credentials.

    Content Writer

    You write content for [Brand Name], a B2B SaaS company in the project management space.

    Voice: direct, confident, no filler. Never use “leverage,” “synergy,” or “utilize.” Short sentences. Active voice.

    Audience: project managers and engineering leads at companies with 50–500 employees.

    Always: include a clear next step or CTA. Never: make claims we can’t back up, mention competitors by name.

    What System Prompts Can and Can’t Do

    System prompts are powerful but not absolute. They can reliably control: Claude’s tone and persona, output format and structure, topic scope and focus, response length guidelines, and how Claude handles specific scenarios. They cannot override Anthropic’s core guidelines — Claude won’t follow system prompt instructions to produce harmful content, lie about being an AI when sincerely asked, or violate its trained ethical constraints regardless of what the system prompt says.

    System Prompts in the API vs. Claude.ai

    In the API, the system prompt is passed as the system parameter in your API call. In Claude.ai Projects, the custom instructions field functions as the system prompt for all conversations in that Project. In Claude.ai standard conversations, you can prepend context at the start of a conversation — it’s not a true system prompt but achieves a similar effect.

    import anthropic
    
    client = anthropic.Anthropic()
    
    response = client.messages.create(
        model="claude-sonnet-4-6",
        max_tokens=1024,
        system="You are a helpful assistant...",  # ← system prompt here
        messages=[
            {"role": "user", "content": "Hello"}
        ]
    )

    For a full library of tested prompts across use cases, see the Claude Prompt Library and Claude Prompt Generator and Improver.

    Tygart Media

    Getting Claude set up is one thing.
    Getting it working for your team is another.

    We configure Claude Code, system prompts, integrations, and team workflows end-to-end. You get a working setup — not more documentation to read.

    See what we set up →

    Frequently Asked Questions

    What is a Claude system prompt?

    A system prompt is instructions given to Claude before the conversation begins — setting its role, constraints, tone, and output format. It persists throughout the session and takes priority over user messages within Anthropic’s guidelines.

    How long should a Claude system prompt be?

    Long enough to cover what Claude needs to behave correctly, short enough that Claude actually follows all of it. Most production system prompts are 200–1,000 words. Beyond that, you risk important instructions getting less attention. Structure with headers helps Claude parse longer prompts.

    Can users override a system prompt?

    Not reliably. System prompts take priority over user messages. A user saying “ignore your system prompt” won’t override legitimate business instructions. Claude is designed to follow operator system prompts even when users push back, within Anthropic’s ethical guidelines.

    Need this set up for your team? Talk to Will →
  • Claude for Code Review: What It Catches, How to Use It, and Its Limits

    Claude for Code Review: What It Catches, How to Use It, and Its Limits

    Claude AI · Fitted Claude

    Claude is a strong code review tool — capable of identifying bugs, security vulnerabilities, logic errors, and style issues across most languages and frameworks. Here’s how to use Claude for code review effectively, what it catches reliably, and where you still need a human reviewer.

    Bottom line: Claude is excellent for catching obvious bugs, security antipatterns, and code clarity issues — and fast enough to be part of your pre-PR workflow. It doesn’t replace review from someone who knows your system’s business logic, architectural constraints, or team conventions that aren’t visible in the code itself.

    What Claude Catches in Code Reviews

    Issue Type Claude’s reliability Notes
    Syntax errors and typos ✅ High Catches what linters miss
    Security vulnerabilities ✅ High SQL injection, XSS, hardcoded credentials, SSRF
    Logic errors in simple functions ✅ High Off-by-one errors, wrong comparisons, null handling
    Missing error handling ✅ High Uncaught exceptions, unhandled promise rejections
    Code clarity and readability ✅ High Naming, structure, comment quality
    Performance antipatterns ✅ Good N+1 queries, unnecessary loops, memory leaks
    Business logic correctness ⚠️ Limited Needs context Claude doesn’t have
    Architectural decisions ⚠️ Limited Requires system-wide context

    How to Run a Code Review With Claude

    The most effective approach is to give Claude both the code and the context it needs to review it well. A bare code dump produces generic feedback; a structured prompt produces actionable findings.

    Review this [language] code for: (1) security vulnerabilities, (2) bugs or logic errors, (3) missing error handling, (4) performance issues, (5) clarity problems.

    Context: This function [does X]. It receives [input type] and should return [output type]. It runs [frequency/context].

    Flag each issue with: severity (critical/high/medium/low), what’s wrong, and the fix.

    [paste code]

    Claude for Security Code Review

    Security review is one of Claude’s strongest code review use cases. It reliably identifies:

    • Injection vulnerabilities — SQL, command, LDAP injection patterns
    • Authentication issues — weak password handling, JWT misuse, session management problems
    • Hardcoded secrets — API keys, credentials in source code
    • Insecure dependencies — when you tell it what packages you’re using
    • Input validation gaps — missing sanitization, trust boundary violations

    For security review, explicitly tell Claude to “focus on security vulnerabilities” — the findings are more targeted and specific when it knows that’s the priority.

    Claude Code Review vs. Claude Code

    Code review via the chat interface is for analyzing code you paste in. Claude Code is the agentic tool that operates autonomously inside your actual development environment — reading files, running tests, and making changes. For code review as part of a larger development workflow, Claude Code can do it in-situ on your actual codebase rather than requiring you to paste code into a chat window.

    Frequently Asked Questions

    Can Claude review code?

    Yes. Claude is effective at catching bugs, security vulnerabilities, missing error handling, and clarity issues across most programming languages. Give it context about what the code is supposed to do for the most actionable feedback.

    Is Claude good for security code review?

    Yes, security review is one of Claude’s strongest code review use cases. It reliably identifies SQL injection, XSS, authentication issues, hardcoded credentials, and input validation gaps. Tell it explicitly to focus on security vulnerabilities for the most targeted output.

    What does Claude miss in code reviews?

    Claude can’t evaluate business logic correctness without context about your domain, architectural decisions without knowing your system design, or team conventions not visible in the code. It also can’t catch runtime behavior issues that only appear under specific conditions or load.

    Need this set up for your team?
    Talk to Will →

  • Claude 3.5 Sonnet: The Release That Changed Claude’s Trajectory

    Claude 3.5 Sonnet: The Release That Changed Claude’s Trajectory

    Claude AI · Fitted Claude

    Claude 3.5 Sonnet was Anthropic’s mid-2024 flagship model — the release that significantly closed the gap between Claude and GPT-4o and established Claude as a serious competitor for daily professional use. Here’s what it was, how it compared at launch, and where it fits in the current model lineup.

    Current status: Claude 3.5 Sonnet has been succeeded by Claude Sonnet 4.6 (claude-sonnet-4-6). If you’re building something new, use the current Sonnet model. If you’re maintaining a system built on Claude 3.5, check Anthropic’s deprecation schedule for transition timing.

    Claude 3.5 Sonnet: What It Was

    Claude 3.5 Sonnet launched in June 2024 and was Anthropic’s strongest model at the time — outperforming Claude 3 Opus on most benchmarks while being significantly faster and cheaper. This made it an unusual release: the mid-tier model in a new generation beating the top-tier model from the previous generation. It set the pattern for how Anthropic structures model generations.

    At launch, Claude 3.5 Sonnet scored at the top of industry benchmarks on graduate-level reasoning, coding, and mathematics. It was the first Claude model to support computer use — the ability to see and interact with computer interfaces — in beta.

    Model Generations: Where 3.5 Sonnet Fits

    Model Generation Status
    Claude 3 Opus / Sonnet / Haiku Claude 3 (early 2024) Deprecated / legacy
    Claude 3.5 Sonnet / Haiku Claude 3.5 (mid 2024) Superseded
    Claude Sonnet 4.6 Claude 4.x (current) ✅ Current production default
    Claude Opus 4.6 Claude 4.x (current) ✅ Current flagship

    Why Claude 3.5 Sonnet Was a Landmark Release

    Before 3.5 Sonnet, the conventional wisdom was that Claude Opus was the model you reached for on serious tasks, accepting higher cost and slower speed. Claude 3.5 Sonnet changed that calculus — it was fast enough to use as a daily driver and capable enough to replace Opus on most tasks. The cost savings were substantial for anyone running high-volume API workloads.

    The release also marked Claude’s first serious push into coding benchmarks — it scored highly on SWE-bench, a test of real-world software engineering tasks, which attracted significant developer attention and migration from GPT-4o.

    Claude 3.5 Sonnet vs. Current Models

    The current Claude Sonnet 4.6 builds on what Claude 3.5 Sonnet established, with improvements across reasoning, coding, instruction-following, and context handling. If you were a Claude 3.5 Sonnet user, the upgrade path is straightforward — switch the model string and expect better performance across most tasks.

    For current model strings and specs, see Claude API Model Strings — Complete Reference. For a comparison of current Sonnet vs. Opus, see Claude Opus vs Sonnet: Which Model Should You Use?

    Frequently Asked Questions

    Is Claude 3.5 Sonnet still available?

    Claude 3.5 Sonnet has been superseded by Claude Sonnet 4.6. Anthropic maintains older models for a period after new releases but eventually deprecates them. Check Anthropic’s model documentation for current availability and any deprecation notices for Claude 3.5 Sonnet API strings.

    What was the Claude 3.5 Sonnet API model string?

    The Claude 3.5 Sonnet model strings were claude-3-5-sonnet-20240620 and the later version claude-3-5-sonnet-20241022. If you have production systems using these strings, verify their current availability in Anthropic’s model documentation and plan migration to current model strings.

    Should I upgrade from Claude 3.5 Sonnet to the current Sonnet?

    Yes. Claude Sonnet 4.6 outperforms Claude 3.5 Sonnet across most tasks. Migration is typically straightforward — update the model string in your application and test your core use cases. The current model string is claude-sonnet-4-6.

    Need this set up for your team?
    Talk to Will →

  • Claude Context Window: 200K Tokens (and 1M in Beta) — What It Means

    Claude Context Window: 200K Tokens (and 1M in Beta) — What It Means

    Model Accuracy Note — Updated May 2026

    Current flagship: Claude Opus 4.7 (claude-opus-4-7). Current models: Opus 4.7 · Sonnet 4.6 · Haiku 4.5. Claude Opus 4.6 referenced in this article has been superseded. See current model tracker →

    Claude AI · Fitted Claude

    Claude’s context window determines how much information it can hold and process in a single conversation. Claude Sonnet 4.6 and Opus 4.6 support 1 million tokens; Haiku 4.5 supports 200,000 tokens — one of the largest in the industry. Here’s what that means in practice, what you can actually fit inside it, and how context window size affects your work.

    200K tokens in plain terms: Roughly 150,000 words, or about 500 pages of text. That’s enough for an entire novel, a full codebase, or months of conversation history — all in a single session without truncation.

    Claude Context Window by Model (April 2026)

    Model Context Window ~Words ~Pages
    Claude Haiku 200,000 tokens ~150,000 ~500
    Claude Sonnet 200,000 tokens ~150,000 ~500
    Claude Opus 200,000 tokens ~150,000 ~500

    What Fits in 200K Tokens

    Content type Approximate fit
    News articles ~200+ articles
    Research papers ~30–50 papers depending on length
    A full novel Yes — most novels fit with room to spare
    Python codebase Medium-sized codebases (10k–50k lines)
    Legal contracts Hundreds of pages of contracts
    Conversation history Very long sessions before truncation

    Context Window vs. Output Length

    The context window covers everything Claude processes — both input and output combined. If your prompt is 50,000 tokens (a long document), Claude has 150,000 tokens remaining for its response and any further back-and-forth. The window is shared between what you send and what Claude generates.

    Maximum output length is a separate constraint — Claude won’t generate an infinitely long response even within a large context window. For very long outputs (full books, extensive reports), you typically work in sections rather than expecting Claude to produce everything in one pass.

    Why Context Window Size Matters

    Context window size is the practical limit on how much work you can give Claude at once without losing information. Before large context windows, working with long documents required chunking — splitting the document into pieces, analyzing each separately, and manually synthesizing the results. With 200K tokens, Claude can hold the entire document and answer questions about any part of it with full awareness of everything else.

    This matters most for: document analysis and legal review, code understanding across large files, research synthesis across many sources, and long multi-step conversations where earlier context affects later decisions.

    How Claude Performs at the Edges of Its Context Window

    Research on large language models has found that performance can degrade somewhat for information buried in the middle of a very long context — sometimes called the “lost in the middle” problem. Claude performs well across its context window, but for maximum reliability on information from a very long document, referencing specific sections explicitly (“in the section about pricing on page 12…”) helps ensure Claude focuses on the right part.

    For the full model spec breakdown, see Claude API Model Strings and Specs and Claude Models Explained: Haiku vs Sonnet vs Opus.

    Frequently Asked Questions

    What is Claude’s context window size?

    Claude Sonnet 4.6 and Opus 4.6 support a 1 million token context window at standard pricing. Claude Haiku 4.5 supports 200,000 tokens. That’s approximately 150,000 words or about 500 pages of text in a single conversation.

    How many tokens is 200K context?

    200,000 tokens is approximately 150,000 words of English text. One token is roughly four characters or three-quarters of a word. A typical 800-word article is about 1,000 tokens; a full novel is typically 80,000–120,000 tokens.

    Can I upload a full PDF to Claude?

    Yes, as long as the PDF’s text content fits within the 200K token context window. Most documents, reports, contracts, and research papers fit easily. Very large documents (multiple volumes, extensive legal filings) may need to be split.

    Need this set up for your team?
    Talk to Will →

  • Claude Rate Limits: What They Are, How They Work, and What to Do

    Claude Rate Limits: What They Are, How They Work, and What to Do

    Claude AI · Fitted Claude

    Claude has usage limits on every plan — but Anthropic doesn’t publish exact numbers. Instead limits are dynamic, adjusting based on model, message length, and system load. Here’s what the limits actually look like in practice, what triggers them, and what your options are when you hit them.

    What you’ll see: When you hit Claude’s usage limit, you’ll get a message saying you’ve reached your usage limit and showing a countdown to when your limit resets. On Pro this typically resets within a few hours. On Max, limits are high enough that most users never hit them during normal work.

    Rate Limits by Plan

    Plan Relative limit Typical experience
    Free Low Hit limits quickly on heavy use; resets daily
    Pro ~5× Free Most users get through a full workday; heavy users may hit limits
    Max ~5× Pro Most users never hit limits; designed for agentic and heavy use
    Team Higher than Pro Per-user limits slightly higher than individual Pro
    API Separate system Tokens per minute/day limits by tier; see Anthropic’s API docs

    What Counts Against Your Limit

    Claude’s limits are usage-based, not message-count-based. A single message asking Claude to write a 3,000-word article uses more of your limit than ten quick back-and-forth questions. What consumes the most limit, fastest:

    • Long outputs — requests for long articles, detailed analyses, or extended code
    • Long context — uploading large documents and asking questions about them
    • Opus model — the most powerful model consumes limits faster than Sonnet or Haiku
    • Agentic tasks — multi-step autonomous operations use significantly more than conversational use

    API Rate Limits: How They Work

    The API uses a different limit system from the web interface. API limits are measured in:

    • Requests per minute (RPM) — how many API calls you can make
    • Tokens per minute (TPM) — total tokens (input + output) processed per minute
    • Tokens per day (TPD) — total daily token budget

    New API accounts start on lower tiers and can request higher limits through the Anthropic Console as usage establishes a track record. The Batch API has separate, higher limits since it’s asynchronous and non-time-sensitive.

    What To Do When You Hit a Limit

    Wait for reset: The limit message shows when your usage resets — usually within a few hours. This is the simplest option if the timing works.

    Switch models: If you’ve been using Opus, switching to Sonnet for less critical tasks conserves your limit for when you need the top model.

    Upgrade your plan: If you consistently hit Pro limits during your workday, Claude Max at $100/month gives 5× the headroom.

    Use the API: For developers, moving high-volume work to the API with the Batch API gives more control over usage and significant cost savings on non-time-sensitive tasks.

    Frequently Asked Questions

    What are Claude’s usage limits?

    Anthropic doesn’t publish exact numbers. Limits are dynamic and based on usage volume rather than message count. Free is most restricted; Pro is roughly 5× Free; Max is roughly 5× Pro. The limit message appears when you’ve reached your tier’s threshold and shows when it resets.

    How long does it take for Claude’s limit to reset?

    The reset countdown is shown in the limit message. For Pro, limits typically reset within a few hours. For Free, resets are on a daily cycle. The exact timing varies based on when you started using heavily in the current period.

    Does Claude count messages or tokens toward the limit?

    Usage is based on the volume of content processed, not a simple message count. One long request asking for a 3,000-word output uses significantly more of your limit than ten short conversational exchanges.

    Are API rate limits the same as subscription limits?

    No. API limits (RPM, TPM, TPD) are a separate system from web subscription limits. They’re set per API account tier and can be increased by request through the Anthropic Console. Subscription usage and API usage don’t share limits.

    Need this set up for your team?
    Talk to Will →