Anthropic shipped Claude Sonnet 5 (claude-sonnet-5) on 2026-06-30 as the most agentic Sonnet tier yet — positioned close to Opus 4.8 on agentic search and computer-use benchmarks, at $2 / $10 per million input/output tokens through 2026-08-31 (then $3 / $15). It is the default on Free and Pro, available in Claude Code and on the Claude Platform. For agent workflows, treat it as the upgraded daily driver and sub-agent worker; keep Opus 4.8 for orchestration and tasks where you need the highest accuracy at High or XHigh effort.
Definitions
Sonnet 5 — Anthropic’s mid-tier model (claude-sonnet-5). Replaces Sonnet 4.6 as the default Sonnet class. Anthropic reports strict improvement over Sonnet 4.6 on BrowseComp (agentic search) and OSWorld-Verified (computer use), with cost-performance curves that overlap Opus 4.8’s range at lower per-token price.
Effort level — a Sonnet/Opus parameter that trades token spend for accuracy. Anthropic’s launch charts show Sonnet 5 at High or XHigh effort can burn more tokens chasing Opus-level scores — the bill advantage shrinks or reverses at the top of the curve.
Cyber safeguards — real-time blocks on dangerous cybersecurity usage. Enabled by default on Sonnet 5 (same tier as Opus 4.7/4.8; less strict than Fable 5).
What shipped
| Sonnet 5 (intro) | Sonnet 5 (standard) | Opus 4.8 | |
|---|---|---|---|
| API ID | claude-sonnet-5 |
claude-sonnet-5 |
claude-opus-4-8 |
| Input / output ($/M tokens) | $2 / $10 (through 2026-08-31) | $3 / $15 | $5 / $25 |
| Default on claude.ai | Free + Pro | Free + Pro | Max / Team / Enterprise |
| vs Sonnet 4.6 | Strict improvement on agentic benchmarks; lower misalignment and hallucination rates | Same tokenizer caveat (below) | Higher accuracy on BrowseComp and OSWorld-Verified at top effort |
| Cyber capability | Substantially weaker than Opus on dangerous exploit tasks; safeguards on by default | Same | Stronger; Anthropic recommends Opus for cyber work needing reduced guardrails |
Tokenizer change: Sonnet 5 uses an updated tokenizer (similar to Opus 4.7). The same text can map to ~1.0–1.35× more tokens depending on content type. Anthropic set intro pricing so the switch from Sonnet 4.6 is roughly cost-neutral despite the count shift.
Availability: Claude Code, Claude Platform API, and all claude.ai plans from launch day. Anthropic also raised rate limits across Chat, Cowork, Claude Code, and the Platform for higher-effort runs.
Sources: Introducing Claude Sonnet 5.
Where Sonnet 5 sits in the stack
The three-tier pattern from pick the right Claude tier still holds — only the middle rung moved:
| Role | Tier | Why (post–Sonnet 5) |
|---|---|---|
| Orchestrator | Opus 4.8 | Longest horizon, highest autonomy, best top-end accuracy on agentic evals |
| Default worker | Sonnet 5 | Near-Opus agentic quality at 40% of Opus input cost ($3 vs $5 standard; $2 vs $5 intro) |
| Parallel worker | Haiku 4.5 | Still the cheapest fan-out tier until Haiku gets its own bump |
Early-access partners quoted on Anthropic’s launch post describe end-to-end task completion, unprompted self-checking, brownfield debugging, and fewer steps to the same outcome — the behaviors that matter for sub-agents spawned under an Opus orchestrator. See subagent context isolation for why the worker tier’s quality matters more than the orchestrator’s benchmark delta.
Encoded for launchers and config:
ORCHESTRATOR=claude-opus-4-8 # long-horizon, max accuracy at high effort
DEFAULT=claude-sonnet-5 # daily driver + sub-agent default (was sonnet-4-6)
WORKER=claude-haiku-4-5 # parallel scoped tasks
Reading the cost-performance curve
Anthropic’s launch charts plot cost vs accuracy at different effort levels on BrowseComp and OSWorld-Verified. Three takeaways for the bill:
- Sonnet 5 strictly beats Sonnet 4.6 at every effort level on those curves — a real upgrade, not a rename.
- Sonnet 5 and Opus 4.8 cover one range — Sonnet is cheaper at comparable effort; Opus still wins on peak accuracy.
- High and XHigh effort on Sonnet 5 spend more tokens per point of accuracy — at the top of the curve, total cost can approach or exceed Opus 4.8 for the same task quality.
Practical rule: run Sonnet 5 at Medium effort (or lower) for the worker and default roles. Escalate to Opus 4.8 — not Sonnet 5 at XHigh — when the task needs frontier accuracy and you are willing to pay for it.
When to stay on Sonnet 5 vs reach for Opus
| Situation | Tier |
|---|---|
| Interactive coding, document Q&A, most Claude Code sessions | Sonnet 5 |
| Sub-agents under an Opus orchestrator | Sonnet 5 |
| Long-horizon migration, hundreds of tool calls, plan must not drift | Opus 4.8 |
| Cybersecurity work that needs reduced guardrails | Opus 4.8 (per Anthropic) |
| You are burning Pro limits in ~30 minutes on Opus | Sonnet 5 — lower per-token rate extends the same subscription |
Sonnet 5 is not a Fable replacement. Anthropic did not train it for dangerous cyber tasks; it scores substantially below Opus 4.8 and Mythos 5 on exploit-development evaluations and ships with safeguards on by default.
Community reaction (unverified launch-day thread)
A user-supplied r/ClaudeAI launch thread (~80 comments, Wilson modbot TL;DR) split along predictable lines — not Anthropic data:
| Camp | Sentiment |
|---|---|
| Fable-or-bust | Top-voted comments wanted Fable or Opus 5, not another Sonnet |
| Pragmatic | Stronger cheap Sonnet matters more for daily agent work than restoring a frontier model most users cannot afford at volume |
| Bean counters | Cited the same High/XHigh cost curves — Sonnet 5’s value prop blurs when you crank effort to match Opus |
| Limit-reset | Many asked for a usage reset to try the model; unconfirmed at thread time |
Recurring anecdotal hope (not in Anthropic’s post): Sonnet 5 may produce less verbose output than Opus 4.8 for similar results — worth testing locally, not a published claim.
Treat client availability and limit-reset timing as thread-time reports; verify in your plan before publishing those as facts.
Bottom line
Sonnet 5 is the new default worker tier: agentic enough to shrink the gap to Opus 4.8, priced to run all day. Default to Sonnet 5, orchestrate with Opus 4.8, fan out on Haiku 4.5, and match effort level to the role — Medium Sonnet for volume, High Opus when accuracy is the bottleneck. The launch thread wanted Fable; the stack most developers actually run wanted a better Sonnet. Both can be true.