Claude Sonnet 5 closes the gap to Opus without the Opus bill

Anthropic shipped Claude Sonnet 5 (claude-sonnet-5) on 2026-06-30 as the most agentic Sonnet tier yet — positioned close to Opus 4.8 on agentic search and computer-use benchmarks, at $2 / $10 per million input/output tokens through 2026-08-31 (then $3 / $15). It is the default on Free and Pro, available in Claude Code and on the Claude Platform. For agent workflows, treat it as the upgraded daily driver and sub-agent worker; keep Opus 4.8 for orchestration and tasks where you need the highest accuracy at High or XHigh effort.

Definitions

Sonnet 5 — Anthropic’s mid-tier model (claude-sonnet-5). Replaces Sonnet 4.6 as the default Sonnet class. Anthropic reports strict improvement over Sonnet 4.6 on BrowseComp (agentic search) and OSWorld-Verified (computer use), with cost-performance curves that overlap Opus 4.8’s range at lower per-token price.

Effort level — a Sonnet/Opus parameter that trades token spend for accuracy. Anthropic’s launch charts show Sonnet 5 at High or XHigh effort can burn more tokens chasing Opus-level scores — the bill advantage shrinks or reverses at the top of the curve.

Cyber safeguards — real-time blocks on dangerous cybersecurity usage. Enabled by default on Sonnet 5 (same tier as Opus 4.7/4.8; less strict than Fable 5).

What shipped

	Sonnet 5 (intro)	Sonnet 5 (standard)	Opus 4.8
API ID	`claude-sonnet-5`	`claude-sonnet-5`	`claude-opus-4-8`
Input / output ($/M tokens)	$2 / $10 (through 2026-08-31)	$3 / $15	$5 / $25
Default on claude.ai	Free + Pro	Free + Pro	Max / Team / Enterprise
vs Sonnet 4.6	Strict improvement on agentic benchmarks; lower misalignment and hallucination rates	Same tokenizer caveat (below)	Higher accuracy on BrowseComp and OSWorld-Verified at top effort
Cyber capability	Substantially weaker than Opus on dangerous exploit tasks; safeguards on by default	Same	Stronger; Anthropic recommends Opus for cyber work needing reduced guardrails

Tokenizer change: Sonnet 5 uses an updated tokenizer (similar to Opus 4.7). The same text can map to ~1.0–1.35× more tokens depending on content type. Anthropic set intro pricing so the switch from Sonnet 4.6 is roughly cost-neutral despite the count shift.

Availability: Claude Code, Claude Platform API, and all claude.ai plans from launch day. Anthropic also raised rate limits across Chat, Cowork, Claude Code, and the Platform for higher-effort runs.

Sources: Introducing Claude Sonnet 5.

Where Sonnet 5 sits in the stack

The three-tier pattern from pick the right Claude tier still holds — only the middle rung moved:

Role	Tier	Why (post–Sonnet 5)
Orchestrator	Opus 4.8	Longest horizon, highest autonomy, best top-end accuracy on agentic evals
Default worker	Sonnet 5	Near-Opus agentic quality at 40% of Opus input cost ($3 vs $5 standard; $2 vs $5 intro)
Parallel worker	Haiku 4.5	Still the cheapest fan-out tier until Haiku gets its own bump

Early-access partners quoted on Anthropic’s launch post describe end-to-end task completion, unprompted self-checking, brownfield debugging, and fewer steps to the same outcome — the behaviors that matter for sub-agents spawned under an Opus orchestrator. See subagent context isolation for why the worker tier’s quality matters more than the orchestrator’s benchmark delta.

Encoded for launchers and config:

ORCHESTRATOR=claude-opus-4-8   # long-horizon, max accuracy at high effort
DEFAULT=claude-sonnet-5        # daily driver + sub-agent default (was sonnet-4-6)
WORKER=claude-haiku-4-5        # parallel scoped tasks

Reading the cost-performance curve

Anthropic’s launch charts plot cost vs accuracy at different effort levels on BrowseComp and OSWorld-Verified. Three takeaways for the bill:

Sonnet 5 strictly beats Sonnet 4.6 at every effort level on those curves — a real upgrade, not a rename.
Sonnet 5 and Opus 4.8 cover one range — Sonnet is cheaper at comparable effort; Opus still wins on peak accuracy.
High and XHigh effort on Sonnet 5 spend more tokens per point of accuracy — at the top of the curve, total cost can approach or exceed Opus 4.8 for the same task quality.

Practical rule: run Sonnet 5 at Medium effort (or lower) for the worker and default roles. Escalate to Opus 4.8 — not Sonnet 5 at XHigh — when the task needs frontier accuracy and you are willing to pay for it.

When to stay on Sonnet 5 vs reach for Opus

Situation	Tier
Interactive coding, document Q&A, most Claude Code sessions	Sonnet 5
Sub-agents under an Opus orchestrator	Sonnet 5
Long-horizon migration, hundreds of tool calls, plan must not drift	Opus 4.8
Cybersecurity work that needs reduced guardrails	Opus 4.8 (per Anthropic)
You are burning Pro limits in ~30 minutes on Opus	Sonnet 5 — lower per-token rate extends the same subscription

Sonnet 5 is not a Fable replacement. Anthropic did not train it for dangerous cyber tasks; it scores substantially below Opus 4.8 and Mythos 5 on exploit-development evaluations and ships with safeguards on by default.

Community reaction (unverified launch-day thread)

A user-supplied r/ClaudeAI launch thread (~80 comments, Wilson modbot TL;DR) split along predictable lines — not Anthropic data:

Camp	Sentiment
Fable-or-bust	Top-voted comments wanted Fable or Opus 5, not another Sonnet
Pragmatic	Stronger cheap Sonnet matters more for daily agent work than restoring a frontier model most users cannot afford at volume
Bean counters	Cited the same High/XHigh cost curves — Sonnet 5’s value prop blurs when you crank effort to match Opus
Limit-reset	Many asked for a usage reset to try the model; unconfirmed at thread time

Recurring anecdotal hope (not in Anthropic’s post): Sonnet 5 may produce less verbose output than Opus 4.8 for similar results — worth testing locally, not a published claim.

Treat client availability and limit-reset timing as thread-time reports; verify in your plan before publishing those as facts.

Bottom line

Sonnet 5 is the new default worker tier: agentic enough to shrink the gap to Opus 4.8, priced to run all day. Default to Sonnet 5, orchestrate with Opus 4.8, fan out on Haiku 4.5, and match effort level to the role — Medium Sonnet for volume, High Opus when accuracy is the bottleneck. The launch thread wanted Fable; the stack most developers actually run wanted a better Sonnet. Both can be true.