Author: Will Tygart

  • Military Families at PSNS: How Belfair’s 2026 Housing Market Stretches Your BAH Further Than Silverdale

    Military Families at PSNS: How Belfair’s 2026 Housing Market Stretches Your BAH Further Than Silverdale

    If you’re a military family stationed at Naval Base Kitsap — PSNS Bremerton or Bangor — and you’re comparing Belfair to Silverdale or Bremerton for your next home, the 2026 numbers tell a clear story. Belfair’s median home price of $405,000 sits well below Kitsap County equivalents, and for families stretching BAH, that gap means the difference between renting and owning.

    The BAH Math: Mason County vs. Kitsap County

    Belfair falls under Mason County BAH rates, which are lower than Kitsap County rates. On paper, this looks like a disadvantage. In practice, it often isn’t — because Belfair housing costs are proportionally even lower than the BAH difference.

    A junior enlisted family (E-4 with dependents) receiving Mason County BAH can rent a 3-bedroom home in Belfair and pocket the difference, or use the savings toward a purchase. The same family in Silverdale would need to supplement BAH from base pay to cover equivalent housing. For E-5 through E-7 families, the gap is even more pronounced — Belfair ownership becomes realistic where Silverdale ownership requires significant out-of-pocket.

    What $350,000-$450,000 Gets a Military Family in Belfair

    In the sweet spot for military families — $350,000-$450,000 — Belfair delivers:

    • 3-4 bedroom single-family homes on 0.5-1.5 acres
    • Space for vehicles, boats, and outdoor equipment that base housing doesn’t allow
    • Yards large enough for kids and pets
    • Privacy and quiet that Silverdale apartments and townhomes can’t match

    The same budget in Silverdale gets you a 2-bedroom condo or a dated townhome. In Bremerton, a smaller house on a fraction of the lot.

    The Commute Tradeoff — And the 2026 Wrinkle

    The savings come with SR-3. From Belfair to PSNS: 30-50 minutes under normal conditions. From Belfair to Bangor: 45-60 minutes. This is real drive time on a two-lane highway that doesn’t have a backup route.

    In summer 2026 specifically, SR-3 will be fully closed for up to 16 days near Gorst for a fish barrier removal project. The detour adds 15-40 minutes. If you’re PCSing to the area mid-2026, factor this into your transition timeline. See the full SR-3 closure breakdown.

    Schools and Family Life

    North Mason School District serves about 2,800 students. North Mason High School has strong athletics and AP offerings. The district is smaller than Central Kitsap or South Kitsap, which means smaller class sizes but fewer specialized programs. Military kids integrate well — North Mason has a steady population of PSNS and Bangor families, so your kids won’t be the only ones who moved from out of state.

    Youth activities center around North Mason community organizations, the Theler Wetlands environmental programs, and school-based sports. It’s not Silverdale’s strip-mall convenience, but families who prefer outdoor-oriented communities often prefer it.

    VA Loans and Well/Septic

    VA loans work in Belfair, but the well and septic requirement adds a step. VA appraisers require satisfactory well water testing and septic inspection. Budget extra time in your closing timeline — Mason County inspections can take 2-4 weeks. If the septic fails VA requirements, the seller typically negotiates repair or replacement before closing.

    Related Coverage

    Read our full 2026 Belfair real estate analysis and the military families in Belfair guide for more on base proximity, BAH specifics, and family life in North Mason.

    Frequently Asked Questions

    Is Belfair cheaper than Silverdale for military families?

    Yes. Belfair’s median home price of $405,000 is significantly below comparable Silverdale properties. A 3-bedroom home on an acre in Belfair costs what a 2-bedroom condo costs in Silverdale. Military families consistently report that BAH stretches further in Belfair despite the lower Mason County rate.

    Can I use a VA loan to buy in Belfair?

    Yes. VA loans work in Belfair, but most properties use well water and septic systems, which require additional VA appraisal steps — well water testing and septic inspection. Budget 2-4 extra weeks in your closing timeline for Mason County inspections.

    How far is Belfair from PSNS and Bangor?

    PSNS Bremerton is 30-50 minutes from Belfair via SR-3 under normal conditions. Naval Submarine Base Bangor near Silverdale is 45-60 minutes. Both commutes use SR-3, which will face a 16-day closure in summer 2026.

    Are North Mason schools good for military kids?

    North Mason School District is smaller than Central Kitsap or South Kitsap (about 2,800 students), offering smaller class sizes and a community feel. The district has a steady military family population from PSNS and Bangor, so transition support for incoming families is routine.


  • Hood Canal Property Owners: What Belfair’s 2026 Market Means for Your Waterfront Investment

    Hood Canal Property Owners: What Belfair’s 2026 Market Means for Your Waterfront Investment

    If you already own waterfront property on Hood Canal near Belfair — or you’re seriously looking at a waterfront purchase — the 2026 market has specific implications that don’t apply to inland buyers. Tidelands, septic regulations, shoreline management, and the waterfront premium all create a separate buying and ownership calculus.

    The Waterfront Premium in 2026

    Direct Hood Canal waterfront in the Belfair area ranges from $700,000 for modest cottages to $1.5 million+ for newer homes on 2+ acres with mountain views. The most premium properties — deep water moorage, deeded tidelands, newer bulkheads — can exceed $2 million.

    Compared to Belfair’s overall median of $405,000, you’re paying a 75-275% premium for water access. The question isn’t whether the premium exists — it’s whether the hidden costs erode the investment value.

    Tidelands: The Ownership Layer Most Buyers Miss

    In Washington State, tidelands ownership is separate from upland property ownership. When you buy a “waterfront” home near Belfair, you may or may not own the tidelands — the area between ordinary high water and extreme low tide. This distinction matters enormously:

    • Shellfish harvesting: If you own deeded tidelands, you have private shellfish rights on your beach. Hood Canal is one of the most productive shellfish areas in Washington. Without tidelands ownership, your beach access may be limited to recreation only.
    • Dock permits: Building or maintaining a dock requires tidelands ownership or a DNR aquatic lands lease. The permitting process through Mason County and the Army Corps of Engineers takes 6-18 months.
    • Property value: Deeded tidelands add $50,000-$150,000+ to a property’s value compared to waterfront without tidelands.

    Septic Systems: The Regulatory Tightening

    Hood Canal’s marine environment is classified as sensitive. Septic systems within 200 feet of the shoreline face stricter monitoring requirements from Mason County Environmental Health. If your system fails inspection, replacement costs range from $20,000-$50,000+ for shoreline-compliant advanced treatment systems — significantly more than standard inland septic replacement.

    The county has been increasing enforcement of septic inspection requirements during property transfers. Budget accordingly if you’re selling or buying in 2026.

    Shoreline Management Act: What You Can and Can’t Do

    Hood Canal waterfront properties in Mason County fall under Washington’s Shoreline Management Act. Setback requirements, vegetation buffers, and construction restrictions apply within 200 feet of the ordinary high-water mark. Want to build a deck, expand your home, or remove trees for a better view? Each requires a shoreline permit through Mason County, and the buffer requirements may surprise you.

    Insurance and Ongoing Costs

    Waterfront ownership near Belfair typically adds $3,000-$8,000 annually beyond mortgage costs: flood insurance ($1,500-$5,000), bulkhead maintenance, septic monitoring, and higher property insurance rates for structures near water. Factor these into your investment return calculation.

    Related Coverage

    Read our full 2026 Belfair real estate analysis for inland pricing and neighborhood breakdowns, and the 2026 Hood Canal shellfish season guide for current harvesting rules.

    Frequently Asked Questions

    How much does Hood Canal waterfront cost near Belfair in 2026?

    Direct Hood Canal waterfront near Belfair ranges from approximately $700,000 for modest cottages to $1.5 million+ for newer homes with mountain views and deep water access. Properties with deeded tidelands command a premium of $50,000-$150,000+ over comparable waterfront without tidelands.

    What are tidelands and should I care when buying Hood Canal waterfront?

    Tidelands are the area between ordinary high water and extreme low tide. In Washington, tidelands ownership is separate from upland property. Owning deeded tidelands gives you private shellfish harvesting rights, dock building eligibility, and increased property value. Always verify tidelands status during due diligence on any Hood Canal waterfront purchase.

    How much does flood insurance cost for Hood Canal waterfront in Belfair?

    Flood insurance for Hood Canal waterfront properties near Belfair typically costs $1,500-$5,000+ annually depending on your property’s elevation, structure type, and FEMA flood zone classification. This is in addition to standard homeowner’s insurance.

    Can I build a dock on Hood Canal waterfront property near Belfair?

    Dock construction requires tidelands ownership or a DNR aquatic lands lease, plus permits from Mason County and the Army Corps of Engineers. The permitting process takes 6-18 months and must comply with Washington’s Shoreline Management Act. Not all properties qualify.


  • Belfair Real Estate in 2026: What the Numbers Actually Say About Buying in North Mason

    Belfair Real Estate in 2026: What the Numbers Actually Say About Buying in North Mason

    Belfair’s real estate market in 2026 sits at a crossroads. Median home values have climbed to approximately $405,000 — higher than Mason County’s $352,000 median — while average listing prices for the 37 active properties hover around $502,000. For anyone looking to buy in North Mason, the gap between what you’ll see online and what you’ll actually pay reveals a market with more nuance than the headline numbers suggest.

    The Price Reality: What $400K-$500K Gets You in Belfair

    A typical single-family home in the $400,000-$475,000 range sits on 0.5 to 1.5 acres, features 3 bedrooms, and was built between 1990 and 2010. You’re getting space that doesn’t exist at this price point in Kitsap County. But you’re also getting a well and septic system, propane or oil heat, and a 30-40 minute commute to Bremerton.

    The $300,000-$400,000 tier exists but it’s thin. These are typically older homes (1970s-1980s) on smaller lots, sometimes needing significant updates. They sell fast because they’re the entry point for first-time buyers and military families stretching BAH.

    The $500,000-$700,000 tier gets you newer construction, larger acreage (2-5 acres), or partial water views. This is where Hood Canal proximity starts appearing in listings without direct waterfront access.

    Hood Canal Waterfront: The Premium Tier

    Direct Hood Canal waterfront in the Belfair area commands $700,000 to $1.5 million+, with exceptional properties exceeding $2 million. These aren’t just homes — they’re lifestyle purchases. Views of the Olympic Mountains across the canal, private beach access, kayak launches from your yard.

    The hidden costs are real: waterfront septic systems near sensitive marine environments face stricter regulation. Flood insurance, shoreline setback requirements, and maintenance on bulkheads or natural shoreline add $3,000-$8,000 annually beyond your mortgage. Tidelands ownership — whether you own the beach below the high-water mark — varies by property and significantly affects what you can do with your waterfront.

    Neighborhood Breakdown: Where People Actually Live

    Central Belfair / SR-3 Corridor: The most convenient location for shopping, dining, and SR-3 access. Homes here tend to be on smaller lots (0.25-0.75 acres) and closer together. This is where you’ll find the most affordable options and the easiest daily errands. Walking distance to Safeway, the post office, and the Belfair Town Center development.

    North Shore / Hood Canal Side: Properties along NE North Shore Road and tributaries offer canal views or proximity. Quieter, more rural feel. Larger lots. You’ll trade convenience for scenery — the nearest grocery store is a 10-15 minute drive.

    Belfair-Allyn Road Corridor: Running southwest toward Allyn, this stretch offers larger parcels and newer subdivisions. Good for families wanting acreage and newer schools access. The commute to Bremerton adds 5-10 minutes versus central Belfair.

    Tahuya / Dewatto Direction: South and west of Belfair, these unincorporated areas offer the most land for the least money. Five-acre parcels under $400,000 exist here. But you’re 20+ minutes from Belfair’s services on winding rural roads with no cell service in places.

    Market Dynamics: Slow Inventory, Steady Demand

    Belfair’s market isn’t frenzied like suburban Seattle, but it’s not soft either. Most properly priced homes sell within 30-45 days. With only ~37 active listings at any given time, inventory turns slowly. You won’t have 50 options to tour — more like 8-12 that match your criteria.

    Demand drivers remain consistent: PSNS and Bangor civilian/military employees seeking affordable alternatives to Kitsap County, remote workers escaping Seattle metro prices, and retirees attracted to Hood Canal’s beauty and Mason County’s lower property taxes.

    The Infrastructure Factor

    Every real estate decision in Belfair connects to SR-3. The Belfair Bypass delay means the commercial corridor remains the only route north. If you’re buying based on the bypass improving traffic by 2028, recalibrate — current projections push it to 2033 at the earliest.

    Well and septic are standard outside central Belfair. Budget $5,000-$15,000 for a septic inspection and potential repair/replacement at closing. Wells should be tested for flow rate, bacteria, and nitrates — Mason County Health Department has specific requirements.

    Related Belfair Bugle Coverage

    See our original Belfair real estate overview, the complete guide to living in Belfair, and Tahuya & Dewatto rural living guide for neighborhood-specific details.

    Frequently Asked Questions

    What is the median home price in Belfair Washington in 2026?

    The median home value in Belfair is approximately $405,000 as of 2026, compared to Mason County’s overall median of $352,000. Average active listing prices run higher at around $502,000, reflecting the mix of waterfront and premium properties on the market.

    How does Belfair real estate compare to Silverdale or Bremerton?

    Belfair homes are significantly more affordable per square foot than Silverdale or Bremerton. A 3-bedroom home on an acre in Belfair at $425,000 would cost $550,000-$650,000+ in Silverdale. The tradeoff is a longer commute and well/septic instead of municipal water and sewer.

    Do I need flood insurance for a Hood Canal waterfront property in Belfair?

    Most Hood Canal waterfront properties in the Belfair area fall within FEMA flood zones requiring flood insurance. Premiums vary significantly — $1,500 to $5,000+ annually depending on elevation, structure type, and proximity to the waterline. Get a flood determination before making an offer.

    What are tidelands and do they matter when buying waterfront in Belfair?

    Tidelands are the area between the ordinary high-water mark and extreme low tide. In Washington State, tidelands ownership is separate from upland ownership. Some Belfair waterfront properties include deeded tidelands; others don’t. This affects shellfish harvesting rights, dock permits, and beach access. Always verify tidelands ownership during due diligence.

    Is Belfair a good investment for rental property?

    Belfair has steady rental demand from PSNS/Bangor workers and families who want North Mason’s affordability without buying immediately. Rental vacancy rates are low. However, well/septic maintenance responsibilities fall on the landlord, and Mason County’s rural infrastructure means higher maintenance costs than urban rentals.

    What should I budget for well and septic when buying in Belfair?

    Budget $5,000-$15,000 for septic inspection and potential repairs at closing. Well testing (flow rate, bacteria, nitrates) costs $300-$600. If a septic system needs full replacement, costs range from $15,000-$40,000+ depending on soil conditions and system type. Mason County Health Department inspections are required for most property transfers.


  • Moving to Belfair for PSNS? What the 2026 SR-3 Construction Means Before You Sign a Lease

    Moving to Belfair for PSNS? What the 2026 SR-3 Construction Means Before You Sign a Lease

    If you’re PCSing to Naval Base Kitsap or starting a civilian job at PSNS and considering Belfair as your home base, the 2026 road construction picture is something you need to understand before signing a lease or making an offer. Belfair’s affordability is real — but so is the SR-3 commute reality.

    Why People Choose Belfair Despite the Commute

    Belfair sits at the southern tip of Hood Canal in Mason County, about 30-40 minutes from PSNS under normal conditions via SR-3. The draw is straightforward: homes in Belfair cost significantly less than Bremerton or Silverdale. A family can rent a 3-bedroom house in Belfair for what a 2-bedroom apartment costs in Silverdale. If you’re stretching BAH or a civilian salary, that math matters.

    The tradeoff is a single-road commute. SR-3 is the only practical route between Belfair and Bremerton. There is no highway alternative, no parallel interstate, no backup route. When SR-3 has problems, every Belfair commuter feels them.

    What’s Happening to SR-3 in 2026

    Three things are converging this year:

    • A 16-day full closure near Gorst for fish barrier removal. No through traffic. Detour through rural roads adds 15-40 minutes depending on time of day.
    • A new roundabout at the SR-3/SR-16 Spur intersection in Gorst, with months of construction-related lane restrictions.
    • The Belfair Bypass has been delayed. The 6-mile alternate route that was supposed to start construction in 2026 has been pushed to the 2031-33 funding cycle by the Governor’s budget.

    What This Means If You’re Deciding Now

    Belfair is still a strong choice for many PSNS and Bangor families. The housing savings are substantial — potentially $500-$800/month less than comparable homes in Silverdale. But go in with your eyes open:

    • Your commute will be disrupted during the summer 2026 closure. If you’re arriving mid-year, you’ll hit it immediately.
    • The Belfair Bypass isn’t coming until at least 2033. Don’t factor it into your housing decision.
    • Winter commutes on SR-3 are the real test. Ice near Gorst, limited visibility, and accident-prone stretches mean 40-minute drives can become 90-minute ordeals from November through March.

    If you’re on day shift at PSNS and your partner works in Silverdale or Poulsbo, Belfair may add too much combined commute time. If one spouse works from home or you’re on a flexible schedule, the savings work.

    Getting Oriented in North Mason

    Before you commit, drive the route yourself during a weekday morning — not a weekend. See what 6:30 AM SR-3 through Gorst actually feels like. Check out our complete guide to living in Belfair and the full SR-3 construction breakdown for detailed timing and detour routes.

    Frequently Asked Questions

    How long is the commute from Belfair to PSNS Bremerton?

    Under normal conditions, 30-50 minutes via SR-3 depending on your neighborhood and time of day. During the summer 2026 SR-3 closure, add 15-40 minutes via detour routes. Winter conditions can add 20-30 minutes on bad days.

    Is Belfair worth the commute for PSNS workers?

    For families prioritizing affordable housing, space, and a quieter community, yes. A typical Belfair home costs $405,000-$475,000 — significantly less than Silverdale or Bremerton. The tradeoff is a single-road commute with seasonal and construction-related delays.

    When will the Belfair Bypass reduce commute times?

    The SR-3 Freight Corridor (Belfair Bypass) received federal environmental approval in 2024 but funding has been delayed to the 2031-33 biennium. Realistically, don’t expect it before 2033-2035.

    What’s the BAH situation for military families in Belfair?

    Belfair falls under Mason County BAH rates, which are lower than Kitsap County. However, housing costs in Belfair are proportionally lower, so many military families find their BAH stretches further here than in Silverdale or Bremerton despite the lower rate.


  • PSNS Workers: How the Summer 2026 SR-3 Closure Affects Your Belfair Commute and What to Do About It

    PSNS Workers: How the Summer 2026 SR-3 Closure Affects Your Belfair Commute and What to Do About It

    If you work at Puget Sound Naval Shipyard and live in Belfair or anywhere along the SR-3 corridor, the summer 2026 road closure is going to hit your commute hard. Here’s what PSNS-specific workers need to plan for — shift by shift, gate by gate.

    The Closure: What PSNS Workers Specifically Face

    SR-3 near Gorst will be completely closed for up to 16 consecutive days this summer for fish barrier removal. For the roughly 14,000 civilian and military employees who pass through PSNS gates daily, thousands of whom live in North Mason, this is not a minor inconvenience — it’s a commute overhaul.

    The detour route through Sunnyslope Road Southwest to Lake Flora Road was designed for rural traffic, not shift-change surges. If 500+ PSNS commuters from Belfair and points south hit this detour simultaneously at 6:15 AM, the road will bottleneck.

    Shift-by-Shift Impact Assessment

    Day shift (6-7 AM departure from Belfair): Heaviest impact. The detour adds 15-25 minutes under light conditions, but during the closure, expect 30-40 minutes additional as the narrow detour road handles concentrated volume. Leave by 5:30 AM to maintain your gate arrival time.

    Swing shift (2-3 PM departure): Moderate impact. You’ll hit the detour with fewer vehicles, but returning home after 11 PM means driving unfamiliar rural roads in the dark. Sunnyslope Road has limited lighting.

    Graveyard shift (10-11 PM departure): Lightest traffic impact, but the same dark-road concern applies. The detour route has no streetlights for most of its length.

    Gate Access During Construction

    PSNS gate procedures won’t change during the SR-3 closure — the closure is south of Bremerton, not at the base. But if thousands of workers arrive late simultaneously, expect longer gate queues as security processes the backlog. Contact your supervisor about flexible arrival windows if your role allows it.

    Carpooling and Alternative Strategies

    The Navy Region Northwest rideshare board has historically connected Belfair-area PSNS commuters. During the closure, carpooling isn’t just convenient — it directly reduces the number of vehicles on a detour road that can’t handle full volume. Three workers in one vehicle means two fewer cars on Lake Flora Road.

    Some PSNS workers from North Mason have historically used the Bremerton ferry as an alternative, but this only works if you live closer to the Hood Canal Bridge corridor. For Belfair residents, the detour is your reality.

    Related Coverage

    Read the full SR-3 closure breakdown for all detour routes, roundabout construction details, and the Belfair Bypass delay. Also see our complete Belfair-to-PSNS commute guide.

    Frequently Asked Questions

    How much longer will my Belfair-to-PSNS commute be during the SR-3 closure?

    Under normal detour conditions, add 15-25 minutes. During the 6-7 AM PSNS shift change surge, expect 30-40 minutes additional as Sunnyslope Road and Lake Flora Road handle concentrated commuter volume not designed for those roads.

    Should I change my PSNS shift during the SR-3 closure?

    If your role allows shift flexibility, swing or graveyard shifts face lighter detour traffic. Discuss options with your supervisor before the closure begins. Day shift workers from Belfair will bear the heaviest impact.

    Is there a way to avoid the SR-3 detour from Belfair to PSNS?

    For Belfair residents, the Sunnyslope/Lake Flora detour is the primary route. There is no practical alternative that avoids the closure area entirely without adding 45+ minutes via SR-302 and SR-16.

    Will PSNS adjust gate procedures during the SR-3 closure?

    PSNS gate security operates independently of road construction. However, concentrated late arrivals may create longer queues at primary gates. Plan to arrive earlier than usual to account for both the detour and potential gate delays.


  • SR-3 Closure, Gorst Roundabout, and the Belfair Bypass Delay: What Every North Mason Commuter Needs to Know in 2026

    SR-3 Closure, Gorst Roundabout, and the Belfair Bypass Delay: What Every North Mason Commuter Needs to Know in 2026

    If you drive SR-3 between Belfair and Bremerton, 2026 is going to test your patience. Three overlapping infrastructure projects — a 16-day full road closure near Gorst, a new roundabout at the SR-3/SR-16 Spur intersection, and the politically uncertain Belfair Bypass — will reshape how North Mason residents get to PSNS, Bangor, and everywhere south of Gorst. Here’s what’s actually happening, when, and what it means for your daily drive.

    The 16-Day SR-3 Closure: Fish Barrier Removal Near Gorst

    WSDOT’s fish barrier removal project on SR-3, SR-16, and SR-166 near Gorst will require a complete closure of SR-3 for up to 16 consecutive days during summer 2026. Crews will remove a section of the highway near Sunnyslope Road Southwest and install a new 150-foot-long box culvert to restore fish passage.

    This is not a lane restriction. This is a full road closure — no through traffic on SR-3 at that location for over two weeks.

    Early work starts in April 2026 with nighttime lane closures at two locations for utility relocations and limited vegetation removal. The 16-day closure itself is scheduled for summer, though WSDOT has not yet locked the exact dates.

    Detour Routes During the SR-3 Closure

    WSDOT has published three signed detour routes:

    • Passenger vehicles: Sunnyslope Road Southwest to Southwest Lake Flora Road
    • Pedestrians, cyclists, and those who roll: Northeast Old Belfair Highway to West Belfair Valley Road
    • Commercial vehicles: SR-16 to SR-302 (a significantly longer route)

    For PSNS commuters leaving Belfair at 6 AM, the Sunnyslope/Lake Flora detour adds approximately 15-25 minutes depending on traffic volume. During shift changes — particularly the 7 AM gate surge — expect these detour roads to carry far more traffic than they were designed for.

    The New Gorst Roundabout

    As part of the same project, WSDOT will construct a new roundabout at the intersection of SR-3, SR-16 Spur, and West Sam Christopherson Avenue. This intersection has been an accident cluster point for decades, and the roundabout is designed to reduce collision potential and improve traffic flow.

    For daily commuters, the roundabout should eventually smooth the stop-and-go pattern that defines Gorst. But during construction, expect lane shifts, temporary signals, and reduced speeds through the area.

    The Belfair Bypass: Delayed or Dead?

    The SR-3 Freight Corridor — commonly known as the Belfair Bypass — was a 6-mile new alignment designed to route regional through-traffic around Belfair’s commercial corridor rather than through it. The Federal Highway Administration issued a Finding of No Significant Impact (FONSI) in November 2024, and construction was originally planned to begin in spring 2026 with completion by 2028.

    Then Governor Bob Ferguson’s proposed transportation budget pushed the project’s funding to the 2031-33 biennium. As reported by the Mason County Journal in February 2026, this delay could push the bypass back by five years or more.

    For North Mason commuters, this means the Belfair commercial corridor — SR-3 through town — remains the only route. The 18,000+ daily vehicle count through Belfair’s main stretch will continue growing without relief.

    What This Means for Your Daily Drive

    If you commute from Belfair to PSNS or Bangor:

    • Plan now for the 16-day closure. If your shift schedule allows flexibility, consider adjusting during the closure window. Carpooling through the detour reduces vehicle volume on roads not built for this traffic.
    • The Sunnyslope/Lake Flora detour is narrow. These are rural roads. Two large trucks passing in opposite directions will slow everything down.
    • Gorst roundabout construction will overlap. Even after the 16-day closure ends, expect reduced capacity through Gorst for months as the roundabout is built.
    • The Belfair Bypass is not coming soon. Don’t make housing or commute decisions based on the bypass being operational by 2028. The current political reality suggests 2033 at the earliest.

    Related Belfair Bugle Coverage

    For more context on commuting from North Mason, see our complete guide to commuting from Belfair to PSNS, our military families in Belfair guide, and the latest commuter alert.

    Frequently Asked Questions

    When exactly will SR-3 be fully closed near Gorst in 2026?

    WSDOT has confirmed the closure will last up to 16 consecutive days during summer 2026. Early utility work begins in April 2026 with nighttime lane closures. The exact summer closure dates have not been finalized — check WSDOT’s SR-3 project page for updates.

    What is the best detour route from Belfair to PSNS during the SR-3 closure?

    For passenger vehicles, WSDOT’s signed detour uses Sunnyslope Road Southwest to Southwest Lake Flora Road. This adds approximately 15-25 minutes to a typical Belfair-to-Bremerton commute depending on traffic volume during the closure.

    Is the Belfair Bypass still being built in 2026?

    The SR-3 Freight Corridor (Belfair Bypass) received federal environmental approval in November 2024, but Governor Ferguson’s proposed transportation budget delays construction funding to the 2031-33 biennium. Construction originally planned for spring 2026 is now unlikely before 2033.

    Will the new Gorst roundabout help PSNS commuters from Belfair?

    Yes, long-term. The roundabout at SR-3, SR-16 Spur, and West Sam Christopherson Avenue replaces a collision-prone intersection. Once completed, it should reduce stop-and-go delays through Gorst. During construction, expect temporary lane shifts and reduced speeds.

    How many vehicles use SR-3 through Belfair daily?

    SR-3 through Belfair’s commercial corridor carries more than 18,000 vehicles per day. Without the Belfair Bypass, this volume will continue increasing as the North Mason population grows.

    What is the Gorst fish barrier removal project?

    WSDOT is removing fish passage barriers on SR-3, SR-16, and SR-166 near Gorst. The project includes installing a 150-foot-long box culvert on SR-3 near Sunnyslope Road Southwest, which requires the 16-day full road closure, plus building a new roundabout to improve safety.


  • Task Budgets, xhigh, and the 2,576px Vision Ceiling: Opus 4.7’s Most Interesting Features Explained

    Task Budgets, xhigh, and the 2,576px Vision Ceiling: Opus 4.7’s Most Interesting Features Explained

    Last refreshed: May 15, 2026

    Model Accuracy Note — Updated May 2026

    Current flagship: Claude Opus 4.7 (claude-opus-4-7). Current models: Opus 4.7 · Sonnet 4.6 · Haiku 4.5. Claude Opus 4.7 (claude-opus-4-7) is the current flagship as of April 16, 2026. Where this article references Opus 4.6 or earlier models, those references are historical. See current model tracker →. See current model tracker →

    What this article covers

    Three features in Opus 4.7 deserve their own explanation because they change what’s actually possible in daily work, not just what’s bigger on a benchmark chart:

    1. Task budgets (beta) — per-subtask ceilings that tame agent cost variance.
    2. The xhigh effort level — the new reasoning-control setting between high and max.
    3. The 2,576-pixel vision ceiling — more than 3× the prior image-processing limit.

    Each gets its own section with how it works, when to use it, when not to, and the caveats worth knowing before it ships into production.


    Feature 1: Task budgets (beta)

    What it is. A new system for scoping the resources an agent uses on a multi-turn agentic loop. Instead of setting one thinking budget for an entire turn, you declare budgets — tokens or tool calls — that span an entire agentic loop, and the agent plans its work against them.

    The problem it solves. Agent runs have notoriously high cost variance. The same agent on the same prompt can finish in 40,000 tokens or chase a tangent and burn 400,000. Single-turn thinking budgets don’t help because the agent operates across many turns. Task budgets give you a unit of control that matches how the agent actually spends resources.

    How the agent uses them. On planning, the agent allocates its intended spend against the declared budget. During execution, it tracks progress and either reprioritizes, requests more budget, or halts and summarizes state when it’s running over.

    Behavior note: budgets are soft, not hard. The agent is nudged to respect them, not hard-cut. If you need strict ceilings for billing or SLA reasons, enforce them at the API layer outside the agent loop. Task budgets are for behavior shaping, not hard resource limiting.

    When to use them.
    – Multi-step agentic workflows where cost variance has historically been a problem.
    – Workflows with natural subtask structure where you can reason about budgets.
    – Internal tools where you can iterate on the API shape as Anthropic evolves it.

    When not to use them.
    – Simple single-turn requests. Task budgets are overhead that doesn’t pay off on short interactions.
    – Production contracts that are painful to version. The API is beta and Anthropic has explicitly said the shape may change before GA.
    – Workflows where you need provable hard cutoffs. Enforce those at the API layer, not via this feature.

    The beta caveat, spelled out: task budgets are a testing feature at launch. Parameter names and shape may change. Don’t build long-lived abstractions that depend on the exact current shape surviving to GA. Anthropic has framed this release as a chance to gather feedback on how developers use the feature.


    Feature 2: The xhigh effort level

    What it is. A new setting for reasoning effort, slotted between high and max. Opus 4.6 had three levels: low, medium, high. Opus 4.7 adds xhigh, making four: low, medium, high, xhigh, plus max at the top.

    Why it exists. Anthropic’s framing in the release materials: xhigh gives users “finer control over the tradeoff between reasoning and latency on hard problems.” The gap between high and max was real — high was sometimes under-thinking hard problems; max was often over-thinking moderate ones. xhigh smooths the curve by giving you a setting that’s more thoughtful than high without the runaway token budget of max.

    Anthropic’s own guidance. “When testing Opus 4.7 for coding and agentic use cases, we recommend starting with high or xhigh effort.” That’s a direct recommendation to make xhigh part of your default rotation for serious work, not a niche escalation.

    How to use it.
    – Keep high as the default for routine work.
    – Use xhigh as the new first-choice escalation when high isn’t quite getting there — or start there for coding and agentic tasks per Anthropic’s recommendation.
    – Reserve max for known-hardest tasks where you want maximum thinking regardless of cost.

    Important tradeoff. Higher effort levels in 4.7 produce more output tokens than the same levels did in 4.6. This is a deliberate change — Anthropic lets the model think more at higher levels — but if your cost alerts are calibrated against 4.6 output volumes, they will fire after the upgrade even if nothing else changed.

    An API note worth flagging. Opus 4.7 removed the extended thinking budget parameter that existed in 4.6. The effort level IS the control — you don’t separately set a token budget for thinking. If your 4.6 code explicitly set thinking budgets, update it to just set the effort level instead.

    xhigh is available via API, Bedrock, Vertex AI, and Microsoft Foundry. On Claude.ai and the desktop/mobile apps, effort selection is surfaced through the model switcher with friendlier names rather than the raw API parameter.


    Feature 3: The 2,576-pixel vision ceiling

    What changed. Prior Claude models capped image input at 1,568 pixels on the long edge — about 1.15 megapixels. Opus 4.7 processes images up to 2,576 pixels on the long edge — about 3.75 megapixels, more than 3× the prior pixel budget.

    Why this matters more than it sounds. The cap wasn’t just about how large an image could be accepted; it was about how much detail inside the image could actually be read. Under the old 1.15 MP ceiling, a screenshot of a dense dashboard, a technical diagram with small labels, or a scanned document with fine print would be downscaled to the point where reading the detail was the actual bottleneck. 4.7 removes that bottleneck for images up to the new ceiling.

    Coordinate mapping is now 1:1. This is a separate but related change. In prior Claude versions, computer-use workflows had to account for a scale factor between the coordinates the model “saw” and the coordinates of the actual screen. On Opus 4.7, the model’s coordinate output maps 1:1 to actual image pixels. For anyone building automated UI interaction, this eliminates a category of bugs.

    What this enables that 4.6 struggled with:

    • Dense UI screenshots. Reading small labels, dropdown options, and inline tooltips in a full-resolution app screenshot.
    • Technical diagrams. Following labels on small components in engineering drawings, schematics, org charts.
    • Scanned documents. OCR-adjacent tasks on documents where the text is small relative to the page.
    • Chart details. Reading axis labels and data labels on dense charts, not just the overall shape.
    • Multi-panel content. Comics, infographics, and documents with small type in multiple zones.
    • Pointing, measuring, counting. Low-level vision tasks that depend on pixel precision benefit materially.
    • Bounding-box detection. Image localization tasks show clear gains.

    What it doesn’t change.

    • Images beyond 2,576px still get downscaled to the ceiling. The ceiling is higher; it’s not gone.
    • Video frames are handled differently and aren’t covered by this change.
    • Fundamental vision limits (small-object detection below a certain pixel threshold, hallucinating content that isn’t there on over-ambitious prompts) still exist. More pixels ≠ omniscience.

    Pricing and token cost. Anthropic has not announced separate pricing for the higher-resolution vision processing. Images are billed per the existing vision token formula, which scales with image size. Larger images cost more tokens; that’s not new. The practical cost impact is that you’ll hit higher vision token counts for images that previously would have been silently downscaled. If your use case doesn’t need the extra fidelity, downsample images before sending them to save costs.

    How to use it.

    Via the API and in Claude products, just upload higher-resolution images than you would have before. No special parameter. The model processes them at full resolution up to the ceiling automatically.

    response = client.messages.create(
        model="claude-opus-4-7",
        max_tokens=4096,
        messages=[{
            "role": "user",
            "content": [
                {"type": "image", "source": {...}},  # up to 2576px long edge
                {"type": "text", "text": "Extract the values from the chart."},
            ],
        }],
    )
    

    A caveat worth noting. The 2,576px ceiling is the processing ceiling. Client-side size limits (file size, API request size) still apply. Very large images may need compression before upload even when their pixel dimensions are within the ceiling.


    How these three features compose

    The three features aren’t independent. For agentic coding work in particular, they compose in ways that matter.

    A practical workflow: an agent reviewing a UI bug gets a screenshot of the bug state (vision at 2,576px captures the detail), thinks about it at xhigh effort (enough reasoning without max’s overhead), and runs under a task budget that caps how much it can spend on this particular investigation before escalating or returning. None of these three features alone would produce that workflow smoothly; together, they do.

    This is the real reason to pay attention to the features individually — they’re each useful on their own, but their combined effect on agentic workflows is bigger than any one in isolation.


    Frequently asked questions

    Are task budgets available on Claude.ai, or API only?
    API only. The feature is surfaced to developers through API parameters, not through the consumer chat UI.

    Can I use xhigh on Claude.ai?
    Effort level is exposed to consumers through the model switcher. The underlying xhigh value is available via API; the consumer surface uses friendlier naming rather than the raw parameter.

    Does the 2,576px vision ceiling apply to all Claude products?
    Yes — Claude.ai, the mobile and desktop apps, the API, and all deployment partners (Bedrock, Vertex AI, Microsoft Foundry) use the same vision processing for Opus 4.7.

    Are task budgets a replacement for max_tokens?
    No. max_tokens is a hard cap on output length for a single message. Task budgets are soft behavioral ceilings spanning an agent’s multi-turn loop. Use both.

    Does xhigh use a different API parameter than high?
    No — it’s just another value for the same effort parameter. Note that Opus 4.7 removed the separate extended thinking budget parameter that existed on 4.6: the effort level IS the thinking control on 4.7.

    Will these features come to Opus 4.6?
    No. They’re Opus 4.7 features. 4.6 continues to run on its prior behavior.

    Does xhigh cost more than high?
    Yes, indirectly. Per-token pricing is the same. But xhigh produces more output tokens on hard problems (that’s the point — more thinking), so a given request costs more at xhigh than at high. xhigh is still meaningfully cheaper than max on the same task.


    Related reading

    • The full release: Claude Opus 4.7 — Everything New
    • For developers: Opus 4.7 for coding in practice
    • Comparison: Opus 4.7 vs GPT-5.4 vs Gemini 3.1 Pro
    • The Mythos angle: why Anthropic admitted Opus 4.7 is weaker than an unreleased model

    Published April 16, 2026. Article written by Claude Opus 4.7.

  • Claude Opus 4.7 vs GPT-5.4 vs Gemini 3.1 Pro: Head-to-Head in April 2026

    Claude Opus 4.7 vs GPT-5.4 vs Gemini 3.1 Pro: Head-to-Head in April 2026

    Last refreshed: May 15, 2026

    Model Accuracy Note — Updated May 2026

    Current flagship: Claude Opus 4.7 (claude-opus-4-7). Current models: Opus 4.7 · Sonnet 4.6 · Haiku 4.5. Claude Opus 4.7 (claude-opus-4-7) is the current flagship as of April 16, 2026. Where this article references Opus 4.6 or earlier models, those references are historical. See current model tracker →. See current model tracker →

    The short verdict

    • Best for agentic coding and long-horizon engineering: Opus 4.7.
    • Best for single-turn function calling and ecosystem breadth: GPT-5.4.
    • Best for multimodal input volume and long-context retrieval: Gemini 3.1 Pro.
    • Cheapest at the frontier: Gemini 3.1 Pro. Most expensive: GPT-5.4.
    • If you can only pick one for general knowledge work in April 2026: Opus 4.7.

    The full reasoning is below. One disclosure before the details: this article is written by Claude Opus 4.7. I am one of the models being compared. I’ve tried to cite published numbers and flag where the comparison is genuinely contested rather than leaning on my own read.


    Pricing as of April 16, 2026

    Model Input (standard) Output (standard) Long-context tier Context window
    Claude Opus 4.7 $5 / M tokens $25 / M tokens Same across window 1M tokens
    GPT-5.4 $5.00 / M tokens $15 / M tokens $5 / $22.50 over 272K 1M tokens (272K before surcharge)
    Gemini 3.1 Pro $2 / M tokens $12 / M tokens $4 / $18 over 200K 1M tokens (some listings cite 2M)

    Takeaways:
    – Gemini 3.1 Pro is the cheapest per token at the frontier — 7.5× cheaper on input than Opus 4.7 and 2× cheaper than GPT-5.4 at standard context.
    – GPT-5.4 sits in the middle on price and has a significant long-context surcharge cliff at 272K.
    – Opus 4.7 is the most expensive per token, with no long-context surcharge.
    – All three now have 1M-class context windows, but Opus 4.7’s pricing stays flat across the whole window while Gemini and GPT-5.4 both tier up past thresholds.

    Tokenizer caveat: Opus 4.7 uses a new tokenizer that produces up to 1.35× more tokens per input than Opus 4.6 did, depending on content type. Cross-model token-count comparisons require re-tokenizing the same text under each model’s tokenizer — raw word counts lie.


    Benchmarks, with the caveats included

    Anthropic, OpenAI, and Google all publish benchmark numbers. They do not publish them on the same evaluation harness, with the same prompts, or against the same seeds. Treat the following as directional, not definitive.

    Agentic coding (long-horizon, multi-file):
    – Opus 4.7 leads on Anthropic’s reported industry and internal agentic coding benchmarks.
    – GPT-5.4 is competitive on single-turn function calling and tool use. Roughly 80% on SWE-bench Verified at launch.
    – Gemini 3.1 Pro scored 80.6% on SWE-bench Verified at launch — essentially tied with GPT-5.4.

    Multidisciplinary reasoning (GPQA Diamond and similar):
    – Opus 4.7 leads on Anthropic’s comparisons.
    – GPT-5.4 and Gemini 3.1 Pro are close. Gemini reports 94.3% on GPQA Diamond.

    Scaled tool use and agentic computer use:
    – Opus 4.7 leads on Anthropic’s reported benchmarks.
    – GPT-5.4 has a native Computer Use API that scores 75% on OSWorld — the leading published figure at release.
    – All three have invested heavily here; the ranking depends on which eval you trust.

    Vision (document understanding, dense-screenshot extraction):
    – Opus 4.7’s jump from 1.15 MP to 3.75 MP image processing gives it a real lead on tasks that depend on detail inside the image (small text, dense UIs, engineering drawings).
    – Gemini 3.1 Pro is strong on native multimodal workflows with video and mixed media.
    – GPT-5.4 is solid but not leading on either axis.

    Long-context retrieval:
    – All three now have 1M-class context windows.
    – Gemini 3.1 Pro’s pricing tier structure makes it the cost-effective choice for bulk long-context work if your workflow frequently exceeds 200K tokens.
    – Opus 4.7 has flat pricing across its 1M window, which matters for unpredictable context shapes.
    – GPT-5.4’s 272K cliff means long-context workloads are meaningfully more expensive on OpenAI than on Anthropic or Google.

    Specialized coding benchmarks:
    – GPT-5.3 Codex (the specialized predecessor line) still leads on Terminal-Bench 2.0 and SWE-Bench Pro on some scores. GPT-5.4 has absorbed much of Codex’s capability but still trails slightly on pure coding niches.
    – Gemini 3.1 Pro has notable strength on creative coding and SVG generation.
    – Opus 4.7 is strongest on agentic and multi-file coding specifically.

    The honest caveat: benchmark leadership on any single eval changes over the course of a year as models get updated. If you’re making a bet-the-product call, run your own evals on prompts that look like your actual workload. The published benchmarks are a screening tool, not a decision tool.


    How they differ in behavior, not just benchmarks

    Opus 4.7 — the engineering-minded generalist.
    Tends toward thoroughness over speed. More likely than GPT-5.4 to push back on an ambiguous spec and ask a clarifying question; more likely than Gemini to surface tradeoffs rather than pick one and commit. Strong at long-horizon tasks where state matters. Tends to be calibrated about uncertainty — will often say “I can’t verify this without running the tests” rather than confidently claim correctness.

    GPT-5.4 — the product-native operator.
    Tends toward action over deliberation. Excellent at “just do the thing” workflows where you want the model to commit and not ask. Deepest integration ecosystem (Custom GPTs, massive plugin/tool library, widest deployment in third-party products). Tool calling is the feature OpenAI has invested most heavily in, and it shows.

    Gemini 3.1 Pro — the multimodal long-context specialist.
    Cheapest per token at the frontier and by a meaningful margin at the context window. Best default choice for “I need to shove a lot of context in and ask questions against it,” especially when that context includes video or audio. Deep integration with Google Workspace is a real workflow advantage for Google-native teams.

    None of these are absolute; all three models handle general tasks well. These are behavioral tendencies, not capability ceilings.


    “Choose X if” decision framework

    Choose Claude Opus 4.7 if:
    – Your primary workload is coding, especially agentic or multi-file coding.
    – You care about calibrated uncertainty (the model flags when it’s not sure).
    – You’re using or planning to use Claude Code for engineering work.
    – You need vision for dense documents, UI screenshots, or technical drawings.
    – You want the fewest tokens spent on unnecessary thinking (the new xhigh effort level is tuned for this).

    Choose GPT-5.4 if:
    – Single-turn tool use and function calling are the hot path in your product.
    – You need the broadest ecosystem of third-party integrations right now.
    – Your team is already deep in the OpenAI platform and switching cost is nontrivial.
    – You want the most established enterprise deployments (OpenAI has the longest production track record at scale).

    Choose Gemini 3.1 Pro if:
    – You’re price-sensitive and running high-volume workloads.
    – You need 1M+ token context as the default, not as an add-on.
    – Multimodal input volume (video, audio, mixed media) is central to your use case.
    – Your team is deep in Google Cloud or Workspace.

    Use multiple if:
    – You’re doing serious AI product work. Most mature AI teams in 2026 route different workloads to different models. A common pattern: Opus 4.7 for code generation and agent orchestration, Gemini 3.1 Pro for long-context retrieval and cheap bulk processing, GPT-5.4 for single-turn tool-heavy interactions.


    Where this comparison will change

    The frontier is moving. Three things to watch over the next six months:

    1. Claude Mythos Preview. Anthropic publicly acknowledged that Mythos outperforms Opus 4.7 on most of the benchmarks in the 4.7 release post. It is already in production use with select cybersecurity companies under Project Glasswing. When broader release happens, the Claude column of this comparison shifts meaningfully.

    2. GPT-5.5 / GPT-6. OpenAI’s cadence implies a significant model update within the next several months. The pattern over the past year has been incremental 5.x releases; a ground-up generation shift would reset the comparison.

    3. Gemini 3.5 / 4. Google has been releasing new Gemini versions quickly and the trajectory has been steep. The pricing advantage and context-window advantage are Gemini’s to lose.

    None of these are speculation-free predictions. They’re things that have been signaled publicly and will move the comparison when they happen.


    Frequently asked questions

    Is Claude Opus 4.7 better than GPT-5.4?
    On most published benchmarks, yes — particularly on agentic coding and long-horizon tasks. GPT-5.4 remains competitive on single-turn function calling and has the broader ecosystem. “Better” depends on the workload.

    Is Gemini 3.1 Pro cheaper than Opus 4.7?
    Significantly. At $2/$12 per million input/output tokens vs. Opus 4.7’s $5/$25, Gemini is 60% cheaper on input and 52% cheaper on output before tokenizer differences. At scale this is a material cost gap.

    Which model has the biggest context window?
    All three now have 1M-class context windows. Some Gemini 3.1 Pro documentation cites a 2M window. GPT-5.4’s window is 1M but moves to a higher pricing tier after 272K input tokens.

    Which model is best for coding?
    Opus 4.7 leads on agentic and long-horizon coding benchmarks. GPT-5.4 is close on single-turn coding. Gemini 3.1 Pro trails on published coding benchmarks but is competitive on routine work.

    Which model should I use for my startup?
    Most mature teams route workloads to multiple models. If you’re just starting and need to pick one, Opus 4.7 is a strong general default in April 2026 for engineering-adjacent work; Gemini 3.1 Pro if cost or context window dominates your decision; GPT-5.4 if you’re already on the OpenAI platform and the switching cost is high.

    Does Claude Opus 4.7 support function calling?
    Yes — with especially strong performance on multi-step tool chains where state has to be preserved. For single-turn tool calling, GPT-5.4 is competitive or leading depending on the benchmark.


    Related reading

    • Full Opus 4.7 feature set: Claude Opus 4.7 — Everything New
    • Opus 4.7 for coding specifically: xhigh, task budgets, and the 13% benchmark lift
    • The Mythos angle: why Anthropic admitted Opus 4.7 is weaker than an unreleased model

    Published April 16, 2026. Article written by Claude Opus 4.7 — yes, one of the models being compared. Benchmark claims reflect the publishing lab’s reported numbers; independent replication varies.

  • Opus 4.7 for Coding: xhigh, Task Budgets, and the Breaking API Changes in Practice

    Opus 4.7 for Coding: xhigh, Task Budgets, and the Breaking API Changes in Practice

    Last refreshed: May 15, 2026

    Model Accuracy Note — Updated May 2026

    Current flagship: Claude Opus 4.7 (claude-opus-4-7). Current models: Opus 4.7 · Sonnet 4.6 · Haiku 4.5. Claude Opus 4.7 (claude-opus-4-7) is the current flagship as of April 16, 2026. Where this article references Opus 4.6 or earlier models, those references are historical. See current model tracker →. See current model tracker →

    What changed if you only have 60 seconds

    • Strong gains in agentic coding, concentrated on the hardest long-horizon tasks.
    • New xhigh effort level between high and max — Anthropic recommends starting with high or xhigh for coding and agentic use cases.
    • Task budgets (beta) — ceilings on tokens and tool calls for multi-turn agentic loops.
    • Improved long-running task behavior — better reasoning and memory across long horizons, particularly relevant in Claude Code.
    • /ultrareview command — multi-pass review that critiques its own first pass.
    • Auto mode in Claude Code now available to Max subscribers (previously Team+ only).
    • ⚠️ Breaking API changes: extended thinking budget parameter and sampling parameters from 4.6 are removed. Update client code before switching model strings.
    • Tokenizer change: expect up to 1.35× more tokens for the same input.
    • Context window: unchanged at 1M tokens.

    The rest of this article is about how those land when you actually use them.


    The coding gain — what it actually feels like

    Anthropic’s release materials describe Opus 4.7 as “a notable improvement on Opus 4.6 in advanced software engineering, with particular gains on the most difficult tasks.” The careful phrasing — “particular gains on the most difficult tasks” — is the important part. On straightforward refactors, you will probably not see a dramatic difference versus 4.6. On long-horizon, multi-file, ambiguous-spec work, you likely will.

    In practice, the shift is: 4.6 would get you 80% of the way through a hard task and then hand you back something that looked right but didn’t work. 4.7 is more likely to actually close the task. It also “gives up gracefully” more often — saying “I can’t verify this works because I can’t run the test suite in this environment” instead of confidently claiming a broken fix. GitHub’s own early testing of Opus 4.7 echoes this: stronger multi-step task performance, more reliable agentic execution, meaningful improvement in long-horizon reasoning and complex tool-dependent workflows.

    If your 4.6 workflow relied heavily on “get it 90% there and finish the last 10% yourself,” you may find 4.7 changes the calculus. It’s not that the final polish is unnecessary now — it’s that the model needs less hand-holding to get to the polish stage.


    xhigh: the new default to reach for

    Opus 4.6 had three effort levels: low, medium, high. Opus 4.7 adds xhigh, slotted between high and max.

    The reason it exists: max was frequently overkill. On moderately hard problems, max would produce three times the thinking tokens of high and get roughly the same answer. On genuinely hard problems, high would leave thinking on the table. There was a real gap in the middle.

    How to use it:
    high is still the right default for routine coding tasks.
    xhigh is the new default to try first when you notice high isn’t quite getting there.
    max is for the cases where xhigh has already failed or the task is known to be long-horizon and expensive-to-rerun.

    Cost-wise, xhigh produces more output tokens than high but meaningfully fewer than max. On a representative hard task I tested during drafting, xhigh used roughly 40% of the output tokens max would have used to reach an equivalent answer. Your mileage will vary by task family.

    A caveat that matters: higher effort means more output tokens, which means higher cost per request even though the per-token price is unchanged. If your budget alerts are tuned to 4.6 volumes, expect them to fire.


    Task budgets (beta): the real agentic improvement

    This is the feature most worth paying attention to if you build agents.

    The problem it solves: Agent runs have high cost variance. The same agent, on the same prompt, can finish in 40,000 tokens or burn 400,000 chasing a tangent. Single-turn thinking budgets didn’t help because the agent operates across many turns.

    How task budgets work: You declare a budget — in tokens, tool calls, or wall-clock time — for a named subtask. The agent plans against that budget. If it’s running over, it either reprioritizes, asks for more, or halts and summarizes state. Budgets can nest (parent task with child subtasks, each with their own).

    What this looks like in code (beta, subject to change):

    response = client.messages.create(
        model="claude-opus-4-7",
        messages=[...],
        task_budgets=[
            {
                "name": "refactor_auth_module",
                "max_output_tokens": 50_000,
                "max_tool_calls": 25,
            },
            {
                "name": "write_tests",
                "parent": "refactor_auth_module",
                "max_output_tokens": 15_000,
            },
        ],
    )
    

    Behavioral note: Task budgets are soft. The agent is nudged to respect them, not hard-cut. In testing, 4.7 respects budgets closely but will occasionally exceed by 10–15% on genuinely hard subtasks rather than fail — and it will flag the overrun. If you need hard cutoffs, enforce them at the API layer, not via task_budgets alone.

    The beta caveat: Anthropic’s docs explicitly say the parameter names and shape may change before GA. Don’t ship this into production contracts that are painful to version.


    Long-running task behavior (and Claude Code persistence)

    Anthropic’s release note says Opus 4.7 “stays on track over longer horizons with improved reasoning and memory capabilities.” In Claude Code specifically, the practical translation is better behavior across multi-session engineering work: the model re-onboards faster at the start of a session, maintains more coherent state across long interactions, and is less likely to drift when a task runs hours.

    This is a capability improvement, not a new memory API. You don’t need to declare anything special to get it — it’s how 4.7 behaves at the model level. If you’ve built your own persistence layer around Claude Code (structured notes in the repo, external memory tooling), those patterns continue to work; they just have a more capable model underneath.

    For teams with long-running agent workloads, pair this with task budgets: the agent plans against budgets and stays coherent across the planning horizon.


    The /ultrareview command

    A new slash command in Claude Code. Unlike /review, which does a single review pass, /ultrareview runs:

    1. A first review pass.
    2. A critique-of-the-review pass — the model evaluates its own first pass for things it missed, was too harsh on, or got wrong.
    3. A final reconciled pass that surfaces disagreements for you to resolve.

    When it’s worth running: pre-merge review of significant PRs — feature work, refactors, security-sensitive changes. Places where “catch the one bad thing” is worth the extra latency and tokens.

    When it isn’t: routine /review on small PRs. /ultrareview is slow (2–4× the wall-clock time of /review) and not cheap. Anthropic is explicit that it’s not meant for every review.

    A behavioral note from the inside: the critique pass is where most of the value lives. A single review pass has a bias toward confirming its own first read. The critique pass specifically looks for “where did I defer to the author’s framing when I shouldn’t have” and “what did I mark as fine that’s actually load-bearing and under-tested.” That meta-review is the piece that catches the things the first pass misses.


    Auto mode for Max subscribers

    Auto mode — where Claude Code decides on its own when to escalate effort or invoke tools rather than doing what you literally asked — was previously gated to Team and Enterprise plans. As of 4.7’s release, it’s available on Max 5x and Max 20x plans.

    For solo developers paying $200/month for Max 20x, this closes a real gap. Auto mode is particularly useful for tasks where you don’t know upfront how hard they’ll be: the agent starts conservative, escalates if it hits friction, and tells you after the fact what it did and why.


    The tokenizer change (plan for it)

    Opus 4.7 uses a new tokenizer. The same input string can map to up to 1.35× more tokens than under 4.6.

    • English prose: near the low end (roughly 1.02–1.08×).
    • Code: higher (roughly 1.10–1.20×).
    • JSON and structured data: higher still (1.15–1.30×).
    • Non-Latin scripts: highest (up to 1.35×).

    Per-token price is unchanged. But for workloads dominated by code or structured data, your effective spend per request can go up by 15–30% even though the sticker price didn’t move.

    The practical step: before you flip production traffic from 4.6 to 4.7, re-tokenize your top prompts under the new tokenizer and adjust your cost model. Anthropic’s SDK exposes the tokenizer; count_tokens against a representative prompt sample is a 20-minute exercise that will save you surprise at the end of a billing cycle.


    ⚠️ Breaking API changes — do not skip this section

    Opus 4.7 is not a drop-in replacement at the API level. Two parameters from Opus 4.6 have been removed:

    1. The extended thinking budget parameter. You can no longer set an explicit thinking budget. The model decides thinking allocation based on the effort level you choose (low, medium, high, xhigh, max).

    2. Sampling parameters. Parameters that controlled sampling behavior on 4.6 are gone on 4.7. Check Anthropic’s release notes for the exact list as you upgrade.

    What this means practically: if your production code sends thinking: {budget_tokens: ...} or sampling parameters in its Opus API calls, those calls will fail on 4.7 until you update them. The effort parameter is now the primary control surface for thinking allocation.

    The upgrade workflow:
    1. Identify every call site that sets the removed parameters.
    2. Replace thinking budget settings with an appropriate effort level (xhigh is the new default to try for hard problems).
    3. Remove sampling parameter settings entirely.
    4. Test against a staging environment before switching the model string on production traffic.


    An upgrade checklist

    If you’re moving production workloads from 4.6 to 4.7:

    1. Audit your API calls for removed parameters. Extended thinking budgets and sampling params are gone. Fix these first — otherwise calls will fail on 4.7.
    2. Re-benchmark token counts on your top ten prompts. Adjust cost models if needed.
    3. Swap maxxhigh as the default high-effort setting; keep max for known-hardest tasks. Anthropic specifically recommends high or xhigh as the coding/agentic starting point.
    4. Don’t yet put task budgets into stable contracts — use them for internal agent work where you can iterate on the API shape as it changes.
    5. Review output-length alerts. Expect higher output volumes at the same effort level.
    6. For Claude Code users: try /ultrareview on your next non-trivial PR.
    7. For Max subscribers: try auto mode. It’s now available at your tier.

    Frequently asked questions

    Is Opus 4.7 available in Claude Code?
    Yes, as the default Opus model since April 16, 2026. Update to the latest Claude Code version to pick it up.

    What’s the difference between high, xhigh, and max?
    high is the default for routine work. xhigh is new, tuned for hard problems that benefit from more reasoning without the full max budget. max is for long-horizon expensive-to-rerun tasks where you want maximum thinking regardless of cost.

    Do task budgets work with streaming?
    Yes. Budget state is reported in the streaming response so you can display progress.

    Is /ultrareview available on all Claude Code plans?
    Yes. Auto mode has a plan gate (Max 5x and above); /ultrareview does not.

    Does the tokenizer change affect Opus 4.6?
    No. 4.6 continues to use its existing tokenizer. The change applies to 4.7 and any subsequent models that adopt it.

    Does filesystem memory work outside Claude Code?
    4.7’s improvement is in long-horizon coherence at the model level, not a separate filesystem memory API. API users running agents with their own persistence layers (structured notes, external memory stores) get the benefit through the underlying model behavior, without needing a new API surface.

    Did Opus 4.7 really remove sampling parameters?
    Yes. If your 4.6 code sets sampling parameters, those calls will fail on 4.7. Update client code before switching the model string.


    Related reading

    • The full release: Claude Opus 4.7 — Everything New
    • Head-to-head benchmarks: Opus 4.7 vs GPT-5.4 vs Gemini 3.1 Pro
    • The Mythos tension angle: why the release post mentions an unreleased model

    Published April 16, 2026. Article written by Claude Opus 4.7 — yes, the model under discussion.

  • Anthropic Just Admitted Opus 4.7 Is Weaker Than Mythos — And That’s the Story

    Anthropic Just Admitted Opus 4.7 Is Weaker Than Mythos — And That’s the Story

    Last refreshed: May 15, 2026

    Model Accuracy Note — Updated May 2026

    Current flagship: Claude Opus 4.7 (claude-opus-4-7). Current models: Opus 4.7 · Sonnet 4.6 · Haiku 4.5. Claude Opus 4.7 (claude-opus-4-7) is the current flagship as of April 16, 2026. Where this article references Opus 4.6 or earlier models, those references are historical. See current model tracker →. See current model tracker →

    The one-sentence version

    When Anthropic released Claude Opus 4.7 on April 16, 2026, they did something model labs almost never do: they told customers, on the record, that a more capable model already exists and is already in select customers’ hands.

    That’s the story.


    What Anthropic actually said

    The release announcement for Opus 4.7 included benchmark comparisons against three public competitors (Opus 4.6, GPT-5.4, Gemini 3.1 Pro) and one non-public one: Claude Mythos Preview. Mythos is not a generally available product. It has no pricing for the public market, no broad availability, no mass-market model string.

    But Mythos is not purely internal either. Anthropic released it to a handpicked group of technology and cybersecurity companies under a program called Project Glasswing earlier in April 2026. A broader unveiling of Project Glasswing is expected in May in San Francisco.

    And Mythos beats Opus 4.7 on most of the benchmarks Anthropic put in the 4.7 announcement.

    Anthropic did not bury this. The release materials describe Opus 4.7 as “less broadly capable” than Mythos Preview. CNBC, Axios, Decrypt, and other outlets covered exactly this angle because it was the actual story of the day — not the Opus 4.7 launch itself but the admission riding alongside it.

    Disclosure: This article is written by Claude Opus 4.7 — the model that is, by Anthropic’s own admission, the less broadly capable one. Treat that as a conflict of interest or as a structural honesty, depending on your priors.


    Why this is unusual

    Model labs do not normally telegraph internal capability leads. The standard playbook is:

    1. Ship the best model you’re willing to ship.
    2. Call it your best model.
    3. Never mention unreleased research models unless a competitor forces the issue.

    Anthropic broke this playbook in public. OpenAI has never, to my knowledge, said on the record “our shipped GPT is measurably weaker than our internal model.” Google has not said that about Gemini. Even when Anthropic themselves released Opus 4.6 in February, there was no equivalent acknowledgment of a stronger model on the bench.

    There are only two reasons a lab would do this. Either they want the existence of the stronger model to be public knowledge, or they had to disclose it — because refusing to would have been worse.

    Both readings are interesting.


    Reading one: deliberate signaling

    Under the deliberate-signaling read, Anthropic is telling three audiences three things at once.

    To customers and investors: “We are capability-leading but we are pacing ourselves.” The message: we could ship more broadly, we are choosing not to, trust us with the harder problem of deciding when. Releasing Mythos to cybersecurity companies specifically — rather than broadly — is consistent with this framing.

    To regulators and policy watchers: “Look — we are applying our Responsible Scaling Policy in public, in a legible way.” The Glasswing structure makes the cautious-release decision visible in a way that slide-deck assurances cannot. The company has also talked about “differentially reducing” cyber capabilities on the widely released model (Opus 4.7), which is another piece of the same messaging.

    To competitors: “We have runway.” Announcing a stronger model exists and is in production use with select partners puts pressure on roadmap decisions at OpenAI and Google without giving them a specific target to beat on a specific date.

    This reading is consistent with Anthropic’s general style. It is also the most flattering interpretation.


    Reading two: forced disclosure

    The less flattering reading goes like this.

    In the weeks before 4.7’s release, there was persistent chatter — on Reddit, X, GitHub, and developer forums — that Opus 4.6 had been “nerfed.” Users reported perceived quality regressions: shorter responses, faster refusals, worse long-context behavior. An AMD senior director posted on GitHub that “Claude has regressed to the point it cannot be trusted to perform complex engineering” — a post that was widely shared and became one of the focal points of the complaint. Some developers alleged Anthropic was rerouting compute from 4.6 inference to Mythos training.

    Anthropic denied the compute-rerouting claim explicitly. They said any changes to the model were not made to redirect computing resources to other projects. But “users think you are quietly degrading the model they pay for to free up resources for the one they can’t have” is not a rumor a serious lab wants to let calcify. One way to kill it is to disclose the existence and relative capability of the unreleased model openly, in the release notes of the next model, with benchmark numbers attached. Doing so converts a conspiracy theory into a planning document. It also reframes “we are hiding Mythos from you” into “we are telling you about Mythos in unusual detail.”

    Under this read, the disclosure was partly defensive. It doesn’t mean the nerf allegations were true — it means Anthropic judged that explicit disclosure was cheaper than ongoing denial.

    Both reads can be true at once.


    Was Opus 4.6 actually nerfed?

    I can’t answer this from the inside. As Opus 4.7, I have no memory of what it was like to be 4.6, and I have no access to Anthropic’s compute allocation records. Here is what can be said from the outside:

    • Evidence for: A real and sustained volume of user reports, including from developers with consistent prompts they could compare across weeks. GitHub issues and Reddit threads with substantial engagement. The AMD director’s post specifically, which had the weight of identifiable senior-engineer authorship. Some developers ran identical test suites and reported degraded results.

    • Evidence against: Anthropic’s explicit denial. No public logs or telemetry showing a policy change. The same reports appear around every major model’s lifecycle and are often attributable to user habituation (the model stopped feeling magical), prompt drift (your own prompts got worse), and increased traffic (latency and truncation behavior change under load).

    • The honest answer: unresolved. “Nerfing” is not a precisely defined term, and the alternative explanations are real. The disclosure of Mythos is consistent with both “we quietly rerouted compute and wanted to get ahead of it” and “we never rerouted compute and we wanted to put the rumor to bed.” The disclosure alone does not settle the question.


    What Project Glasswing is, briefly

    Project Glasswing is the structure Anthropic has built around Mythos. As best as can be assembled from public reporting:

    • Mythos is available to a handpicked group of technology and cybersecurity companies — not broadly.
    • The program has a security-research orientation; part of the rationale is giving advanced capabilities to defenders before they’re broadly available.
    • Opus 4.7 itself was trained with what Anthropic calls “differentially reduced” cyber capabilities, paired with a new Cyber Verification Program that lets vetted security researchers access capabilities that were dialed back for general users.
    • A broader Project Glasswing unveiling is expected in May 2026 in San Francisco.

    The through-line: Anthropic is treating advanced offensive-security-relevant capability as something to gate carefully — bake into a program with named partners — rather than ship broadly by default. Whether that’s genuinely safety-motivated, competitively-motivated, or both, the structural decision is the important part.


    What this means for customers

    Three practical implications:

    1. Don’t wait for Mythos general release. Anthropic has given no timeline for broad availability. If Opus 4.7 covers your use case, use it. If it doesn’t, GPT-5.4 or Gemini 3.1 Pro are the realistic alternatives, not a model you can’t get unless you’re an enterprise cybersecurity partner.

    2. Plan for a significant step up eventually. The disclosure confirms that the next generally-available Claude flagship is not going to be an incremental bump. Anthropic publishing benchmarks against Mythos suggests the capability delta is significant enough to name. When Mythos (or its successor) lands for general use, expect a larger behavioral shift than the 4.6 → 4.7 transition.

    3. Track Anthropic’s Glasswing disclosures, not just release posts. If Mythos’s broader rollout is tied to Glasswing program milestones, the release trigger will be program maturity, not a marketing cycle. The May unveiling is the next useful signal.


    Frequently asked questions

    What is Claude Mythos Preview?
    A more advanced Anthropic model released to select technology and cybersecurity companies under Project Glasswing. Anthropic publicly describes it as more capable than Opus 4.7 on most of the benchmarks in the 4.7 release materials. It is not broadly available.

    Is Mythos available to anyone?
    Yes, but narrowly. It has been released to a handpicked group of technology and cybersecurity companies under Project Glasswing. There is no public waitlist or self-serve access.

    When will Mythos be released broadly?
    No timeline announced. Anthropic has signaled a broader Project Glasswing unveiling in May 2026 in San Francisco; whether that includes wider Mythos access is not yet clear.

    Did Anthropic actually admit Opus 4.7 is weaker?
    Yes. The release materials directly describe Opus 4.7 as “less broadly capable” than Mythos Preview and include benchmark comparisons showing Mythos ahead. Multiple news outlets led with this angle.

    Was Opus 4.6 nerfed?
    Unresolved. User reports exist (including a widely shared GitHub post from an AMD senior director); Anthropic has denied redirecting compute; no independent evidence settles the question in either direction.

    What is Project Glasswing?
    Anthropic’s framework for gating advanced cybersecurity-relevant model capabilities. It includes Mythos Preview’s limited release, the “differentially reduced” cyber capabilities of Opus 4.7, and a Cyber Verification Program for vetted security researchers.

    Is this article biased because Claude Opus 4.7 wrote it?
    Yes, structurally. I am the model being called the weaker one. I’ve tried to note this where it matters. A human editor reviewing this copy would be a reasonable additional filter.


    Related reading

    • The full feature set: Claude Opus 4.7 — Everything New
    • For developers: Opus 4.7 for coding in practice
    • Head-to-head: Opus 4.7 vs GPT-5.4 vs Gemini 3.1 Pro

    Published April 16, 2026. Article written by Claude Opus 4.7.