Introducing Claude Sonnet 5

Anthropic launched Claude Sonnet 5 on June 30, 2026, and immediately made it the default model for every Free and Pro user on claude.ai. The release was announced on the same day it went live, with access available immediately across all subscription tiers, including Free, Pro, Max, Team, and Enterprise, as well as Claude Code, the Claude API, Amazon Bedrock, Google Cloud, and Microsoft Foundry. Cursor also confirmed same-day availability, posting benchmark data showing a meaningful step up from Sonnet 4.6.
The headline claim from Anthropic is precise: Claude Sonnet 5 is the most agentic Sonnet model ever released. It can make multi-step plans, use tools like browsers and terminals, and run autonomously through complex workflows at a level that, just a few months ago, required Opus-class models costing significantly more to operate. For developers, enterprises, and AI practitioners building production agents and automation pipelines, this release represents one of the most practically significant cost-performance improvements in Anthropic's history.

For professionals who want to develop deep, expert-level understanding of Claude's architecture, agentic capabilities, and how to deploy Claude models effectively in production, a structured Claude AI Expert certification provides the rigorous, applied foundation needed to move from casual user to genuine practitioner of Claude-based systems like Sonnet 5.
This guide covers everything: what Claude Sonnet 5 is, how it benchmarks, what it costs, where it is available, how it compares to Opus 4.8, what changed in safety, and how to start using it today.
What Is Claude Sonnet 5?
Claude Sonnet 5 is Anthropic's latest mid-tier AI model and the newest release in the Sonnet series, which also includes Sonnet 3.5, 3.6, 3.7, and 4.6. It replaces Sonnet 4.6, which launched in February 2026, as the default model on Anthropic's consumer and API platforms.
Sonnet models occupy the middle tier of Anthropic's lineup: faster and more affordable than Opus-class models, while offering substantially more capability than Haiku-class models. Historically, the Sonnet series was the first to demonstrate impressive agentic capability: Sonnet 3.5 and 3.7 set early industry standards for autonomous coding and tool use. More recently, however, the most meaningful agentic advances had shifted to Opus-class models. Sonnet 5 is designed to bring those gains back down to mid-tier pricing.
Sonnet 5 achieves near-Opus 4.8 performance on agentic tasks, including reasoning, coding, knowledge work, and multi-step tool use, while remaining cheaper to operate. Anthropic is explicit about the positioning: Opus 4.8 remains the choice for the highest-accuracy requirements, but Sonnet 5 now covers the vast majority of agentic use cases at a fraction of the cost.
Claude Sonnet 5 Benchmarks: How It Performs
Anthropic published benchmark comparisons across multiple evaluations at launch, comparing Sonnet 5 directly against its predecessor Sonnet 4.6 and the current flagship Opus 4.8.
Agentic Coding
On the primary agentic coding benchmark:
Sonnet 5: 63.2%
Opus 4.8: 69.2%
Sonnet 4.6: 58.1%
Sonnet 5 is not yet at Opus 4.8 levels on coding, but the 5.1-point improvement over Sonnet 4.6 is substantial. More importantly, at Extra High reasoning effort, Sonnet 5's coding performance approaches Opus 4.8 at medium-to-high settings, narrowing the gap significantly when developers invest additional compute in each call.
Knowledge Work
On knowledge work benchmarks, Sonnet 5 slightly outperforms Opus 4.8, marking the first time a Sonnet model has exceeded Opus on this category. This makes Sonnet 5 directly applicable for research synthesis, document analysis, and complex information tasks without needing to route to the more expensive Opus tier.
OSWorld-Verified and BrowseComp
On OSWorld-Verified and agentic search benchmarks, Sonnet 5 at Extra High reasoning effort performs in line with Opus 4.8 at medium-to-high settings. Anthropic noted a correction to the launch blog on the same day: the original BrowseComp chart underestimated Sonnet 5's performance due to a simpler methodology. The corrected chart, which uses the standard 10M token budget with compaction, shows stronger performance than initially published.
CursorBench
Cursor published same-day benchmark data showing Sonnet 5 at 57% on CursorBench, compared to 49% for Sonnet 4.6. The 8-point improvement on a real-world coding task benchmark used by one of the most active developer tools in the market is a meaningful signal for engineering teams evaluating the model for daily use.
Humanity's Last Exam
Anthropic also updated the Sonnet 4.6 scores for Humanity's Last Exam due to a grader model change: Sonnet 4.6 now shows 34.6% (no tools) and 46.8% (with tools). These revised baselines provide a cleaner comparison point for evaluating Sonnet 5's advancement on reasoning tasks.
Claude Sonnet 5 Pricing: Introductory and Standard
Claude Sonnet 5 launches with introductory pricing in effect through August 31, 2026:
Period | Input (per 1M tokens) | Output (per 1M tokens) |
Introductory (until Aug 31, 2026) | $2.00 | $10.00 |
Standard (from Sep 1, 2026) | $3.00 | $15.00 |
This is the first time Anthropic has offered introductory pricing on a model launch. A company spokesperson confirmed to The New Stack: "We want our customers to test Sonnet 5 against their real workloads at the lowest possible cost during the migration window." The pricing is explicitly designed to make the transition from Sonnet 4.6 cost-neutral, because Sonnet 5 uses an updated tokenizer that processes the same input into roughly 1.0 to 1.35 times as many tokens depending on content type.
At introductory pricing, Sonnet 5 is cheaper than Opus 4.8, cheaper than GPT-5.5, and cheaper than Gemini 3.1 Pro. It is more expensive than Gemini 3.5 Flash, which occupies a different position as a lighter, faster model optimized for latency-sensitive workloads.
The Tokenizer Change: What It Means for Cost
One of the most practically important technical details in the Sonnet 5 launch is the updated tokenizer, which is similar to the change introduced with Claude Opus 4.7. The updated tokenizer improves performance by changing how the model processes text, but the same input will map to more tokens: approximately 1.0 to 1.35 times as many, depending on content type.
Anthropic set the introductory pricing specifically to offset this effect and keep the migration roughly cost-neutral for teams moving from Sonnet 4.6 to Sonnet 5. However, developers building token-heavy applications should recalibrate their cost models before the September 1 standard pricing kicks in, as the per-token cost increase will compound with the tokenizer change for some workloads.
What Makes Claude Sonnet 5 More Agentic
The word "agentic" in Anthropic's Sonnet 5 launch materials is not marketing language. It describes specific, measurable behavioral changes in how the model executes work compared to Sonnet 4.6. Building deep fluency in how these agentic capabilities work, why they behave the way they do, and how to architect reliable systems around them is increasingly valuable across technology teams. A comprehensive Tech Certification covering the technical landscape of AI systems, agentic architectures, and production deployment patterns can provide the structural knowledge base needed to evaluate and apply models like Sonnet 5 at an architectural level rather than just as an end user.
The specific agentic improvements in Sonnet 5 include:
Multi-Step Planning and Task Completion
Sonnet 5 finishes complex tasks that previous Sonnet models would stall on before completion. Testers cited in Anthropic's blog post confirmed this behavioral shift: the model now pushes through multi-step sequences rather than stopping at the first significant obstacle or ambiguity. This is not simply a longer context window. It is a change in how the model handles uncertainty mid-task.
Autonomous Tool Use
Sonnet 5 can use browsers, terminals, and external tools as part of autonomous workflows without requiring manual orchestration at each step. This brings Sonnet-class models into the same operational territory that previously required Opus 4.8 for reliable execution.
Self-Checking Without Prompting
A behavioral change noted by multiple testers: Sonnet 5 checks its own output without being explicitly asked to do so. This self-verification behavior reduces the number of explicit verification steps developers need to build into their agentic workflows and improves the reliability of end-to-end autonomous task completion.
Zapier Real-World Validation
Zapier Senior Engineer Daniel Shepard provided a specific real-world test case: "We handed Claude Sonnet 5 a two-part job update Salesforce account tiers, send a launch announcement to enterprise contacts and it finished end to end. That used to stall halfway. For day-to-day automation, it's a no-brainer."
This kind of multi-system, multi-step workflow crossing CRM updates and email dispatch in a single autonomous run represents exactly the class of task where Sonnet 4.6 had historically required human intervention to bridge gaps.
Claude Sonnet 5 Safety Profile
Safety is a central part of the Sonnet 5 story, particularly for organizations deploying autonomous agents in sensitive environments.
Lower Rate of Undesirable Behaviors
Sonnet 5 demonstrates a lower rate of undesirable behaviors than Sonnet 4.6 across the categories Anthropic tracks: cooperation with misuse, deception, hallucination, and sycophancy. The model is better at refusing malicious requests and more resistant to prompt-injection attacks, which attempt to hijack the agent mid-task by embedding adversarial instructions in tool outputs or external content.
Lovable co-founder Fabian Hedin confirmed this in launch day feedback: "Claude Sonnet 5 refuses unsafe requests cleanly and consistently."
Cybersecurity Positioning
Anthropic did not deliberately train Sonnet 5 on cybersecurity tasks. Its ability to perform offensive cybersecurity tasks is significantly lower than Opus 4.8 and Claude Mythos Preview. Anthropic still applies real-time cyber safeguards to Sonnet 5, but given the model's lower offensive capability profile, those safeguards are not as restrictive as those applied to higher-capability models.
This positioning means Sonnet 5 is cleared for general agentic deployment without the additional access controls that govern Opus 4.8 for certain use cases.
Not on Par With Mythos Preview
Anthropic is explicit that Sonnet 5 does not reach the safety level of Opus 4.8 or Claude Mythos Preview on aligned behavior in the most demanding agentic scenarios. For organizations requiring the highest available safety margins in autonomous contexts, Opus 4.8 remains the recommended choice.
Adaptive Thinking and Reasoning Effort Levels
Sonnet 5 supports adaptive thinking, which runs at all times. Additionally, it supports multiple reasoning effort levels, allowing developers to tune the compute budget per call based on task complexity and accuracy requirements.
At Extra High reasoning effort, Sonnet 5's performance on OSWorld-Verified and BrowseComp approaches Opus 4.8 at medium-to-high reasoning settings. However, running Sonnet 5 at Extra High effort costs more per call than Sonnet 5 at standard effort, and may approach or exceed the cost of Opus 4.8 at comparable reasoning levels for some tasks. Anthropic's practical recommendation: use Sonnet 5 at High or Extra High effort for complex agentic work, and reserve Opus 4.8 for tasks where even small accuracy differences are genuinely costly.
Where Claude Sonnet 5 Is Available
Claude Sonnet 5 is available immediately across:
claude.ai Default model for Free and Pro plans; available on Max, Team, and Enterprise
Claude Code Available at launch with increased rate limits
Claude API Available today; model ID is a pinned snapshot, not an evergreen pointer
Amazon Bedrock Available at launch
Google Cloud Available at launch
Microsoft Foundry Available at launch
Cursor Available at launch, confirmed 57% on CursorBench vs. 49% for Sonnet 4.6
GitHub Copilot Rolling out per AWS and GitHub changelog updates
Rate Limit Increases
To accommodate the higher token volumes that come with more capable agentic tasks, Anthropic raised rate limits across Chat, Cowork, Claude Code, and the Claude Platform. The API tier structure was simplified in April 2026 to three levels: Start, Build, and Scale. Current limits are visible in the Claude Console.
Claude Sonnet 5 vs. Opus 4.8: When to Use Which
The practical question for most teams is not whether Sonnet 5 is impressive in isolation, but when to use it versus Opus 4.8.
Consideration | Sonnet 5 | Opus 4.8 |
Agentic coding | 63.2% | 69.2% |
Knowledge work | Slightly above Opus 4.8 | Strong baseline |
Cost efficiency | Significantly lower | Higher |
Cybersecurity tasks | Much lower capability | Full capability |
Default for Free/Pro | Yes | No |
Highest-accuracy needs | High/XHigh effort | Preferred |
Anthropic's own summary: "Opus 4.8 is still the model of choice for higher accuracy on these tasks, but Sonnet 5 provides developers with lower-priced options that are of much higher quality than what was previously available. Between Sonnet 5 and Opus 4.8, users can adjust the effort level to find the right balance of cost and performance."
The Broader Context: Fable 5, Mythos, and the Current Anthropic Lineup
The Sonnet 5 launch arrived in a specific regulatory context that matters for understanding what models are currently accessible. On June 12, 2026, a U.S. Commerce Department order suspended Claude Fable 5 and Mythos 5 for all general users worldwide. Those models remain offline for most customers. Sonnet 5 and Opus 4.8 are now the effective ceiling of what the majority of developers and organizations can access.
Anthropic also announced Claude Science on the same day as Sonnet 5, a dedicated AI workbench for scientists that integrates tools and packages commonly used in life sciences research, produces auditable artifacts, and provides flexible computing resource access. Sonnet 5 is the underlying model for Claude Science's initial deployment.
Separately, Anthropic confidentially filed a draft S-1 registration statement with the SEC on June 1, 2026, following a $65 billion Series H round valuing the company at $965 billion. Sonnet 5 is the first major model release since that filing.
Building Expertise for the Claude Sonnet 5 Era
As Claude Sonnet 5 becomes the default model for the majority of Claude users globally and the baseline for most production agentic workflows, practitioners who understand both how to deploy it effectively and how to communicate its value within organizations will hold a meaningful advantage.
On the technical side, deep familiarity with Claude's architecture, agentic behavior, and evaluation frameworks is increasingly essential for anyone building production systems. A Claude AI Expert credential builds exactly that foundation, while a Tech Certification covering AI systems and production deployment provides the broader architectural context needed to evaluate platform decisions, infrastructure choices, and vendor trade-offs intelligently.
On the business and communication side, practitioners who can articulate the strategic and commercial value of Claude Sonnet 5 deployments to non-technical leadership, procurement teams, and customers will be better positioned to lead AI adoption at the organizational level. A Marketing Certification develops the positioning, business case, and stakeholder communication skills that translate technical AI capability into organizational action and sustained investment.
Conclusion
Claude Sonnet 5 is Anthropic's most significant mid-tier model release to date. It brings near-Opus 4.8 agentic performance, including multi-step planning, autonomous tool use, and self-verification behavior, to a model that is cheaper than Opus 4.8, GPT-5.5, and Gemini 3.1 Pro, especially at introductory pricing through August 31, 2026. It replaces Sonnet 4.6 as the default model for Free and Pro users immediately, and it is available today across every major platform and cloud provider.
For developers and enterprises building production agents, the practical implication is straightforward: Sonnet 5 makes near-Opus-grade autonomous workflows accessible at mid-tier cost, widening the range of use cases where a single model can handle the full cost-performance curve from light tasks to complex agentic work. The teams and professionals who build structured expertise in Claude's capabilities now, through credentials like a Claude AI Expert certification, a Tech Certification, and a Marketing Certification, will be best positioned to lead and benefit from the era that Claude Sonnet 5 is ushering in.
Frequently Asked Questions
1. What is Claude Sonnet 5?
Claude Sonnet 5 is Anthropic's latest mid-tier AI model, launched on June 30, 2026. It is the most agentic Sonnet model yet, capable of multi-step planning, autonomous tool use, and self-verification, while approaching Opus 4.8 performance at significantly lower cost.
2. When was Claude Sonnet 5 released?
Claude Sonnet 5 was released on June 30, 2026, and was immediately made the default model for Free and Pro users on claude.ai and available across the Claude API, Amazon Bedrock, Google Cloud, and Microsoft Foundry.
3. Is Claude Sonnet 5 free to use?
Yes. Claude Sonnet 5 is the default model on Anthropic's Free plan. Pro, Max, Team, and Enterprise users also have access, as do API developers via paid API usage.
4. How does Claude Sonnet 5 compare to Opus 4.8?
Sonnet 5 scores 63.2% on agentic coding versus Opus 4.8's 69.2%, slightly exceeds Opus 4.8 on knowledge work, and approaches Opus 4.8 on OSWorld and BrowseComp at Extra High effort. Opus 4.8 remains stronger for the highest-accuracy agentic and cybersecurity tasks.
5. What is the pricing for Claude Sonnet 5?
Introductory pricing is $2 per million input tokens and $10 per million output tokens through August 31, 2026. Standard pricing from September 1 is $3 per million input tokens and $15 per million output tokens.
6. What is introductory pricing and why did Anthropic offer it?
Introductory pricing is a reduced rate available through August 31, 2026, offered because Sonnet 5 uses an updated tokenizer that processes the same input into 1.0 to 1.35 times as many tokens. The lower price is designed to make the migration from Sonnet 4.6 roughly cost-neutral.
7. What changed in the Sonnet 5 tokenizer?
Sonnet 5 uses an updated tokenizer, similar to the change introduced with Opus 4.7, that improves model performance but maps the same input to approximately 1.0 to 1.35 times more tokens depending on content type.
8. What platforms and cloud providers support Claude Sonnet 5?
Claude Sonnet 5 is available on claude.ai, Claude Code, the Claude API, Amazon Bedrock, Google Cloud, Microsoft Foundry, Cursor, and is rolling out on GitHub Copilot.
9. What does "most agentic Sonnet yet" mean?
It means Sonnet 5 can make multi-step plans, use external tools like browsers and terminals autonomously, complete complex tasks that previous Sonnets would stall on, and self-verify its output without being explicitly prompted.
10. Does Claude Sonnet 5 replace Sonnet 4.6?
Yes. Sonnet 5 replaces Sonnet 4.6 as the default model for Free and Pro plans on claude.ai. Sonnet 4.6 launched in February 2026 and was the previous default before this release.
11. What is adaptive thinking in Claude Sonnet 5?
Adaptive thinking is a reasoning mode that runs at all times in Sonnet 5, allowing the model to adjust its reasoning depth and approach based on the complexity of the task without requiring explicit configuration.
12. What reasoning effort levels does Claude Sonnet 5 support?
Claude Sonnet 5 supports multiple reasoning effort levels. At Extra High effort, it approaches Opus 4.8 performance on OSWorld-Verified and BrowseComp benchmarks, though this increases per-call cost.
13. How did Claude Sonnet 5 perform on Cursor's benchmark?
Cursor published same-day data showing Sonnet 5 at 57% on CursorBench, compared to 49% for Sonnet 4.6, an 8-point improvement on a real-world coding task benchmark used by developers daily.
14. Is Claude Sonnet 5 safer than Sonnet 4.6?
Yes. Sonnet 5 shows a lower overall rate of undesirable behaviors than Sonnet 4.6, is better at refusing malicious requests, more resistant to prompt-injection attacks, and hallucinates and behaves sycophantically at lower rates.
15. What cybersecurity tasks can Claude Sonnet 5 perform?
Anthropic did not deliberately train Sonnet 5 on cybersecurity tasks. Its offensive cybersecurity capability is significantly lower than Opus 4.8 and Claude Mythos Preview. Real-time cyber safeguards are applied but are less restrictive than those on higher-capability models.
16. What happened to Claude Fable 5 and Mythos 5?
On June 12, 2026, a U.S. Commerce Department order suspended Claude Fable 5 and Mythos 5 for all general users worldwide. Those models remain offline for most customers. Sonnet 5 and Opus 4.8 are now the effective ceiling for most developers.
17. What is Claude Science and how does it relate to Sonnet 5?
Claude Science, announced on the same day as Sonnet 5, is a dedicated AI workbench for scientists that integrates research tools, produces auditable artifacts, and provides flexible computing resource access. Sonnet 5 is the underlying model for its initial deployment.
18. What real-world results has Claude Sonnet 5 shown?
Zapier confirmed that Sonnet 5 completed a two-part workflow updating Salesforce account tiers and sending a launch announcement end-to-end, a task that previously stalled halfway. Cursor confirmed an 8-point improvement on CursorBench over Sonnet 4.6.
19. What is the model ID for Claude Sonnet 5 in the API?
The Sonnet 5 model ID is a pinned snapshot rather than an evergreen pointer, meaning it will not change automatically as future models are released. Developers should refer to Anthropic's official API documentation for the exact current model string.
20. When should I use Opus 4.8 instead of Claude Sonnet 5?
Use Opus 4.8 when tasks require the highest available accuracy on complex agentic workflows, when cybersecurity depth is required, or when even small accuracy differences have significant downstream consequences. For the vast majority of agentic coding, knowledge work, and automation tasks, Sonnet 5 offers a compelling cost-performance alternative.
Related Articles
View AllClaude Ai
Claude Sonnet 5 Explained: Key Features, Capabilities, and Use Cases
Claude Sonnet 5 brings 1M-token context, adaptive reasoning, stronger agent workflows, and enterprise safety controls for coding and knowledge work.
Claude Ai
Introducing Claude Fable 5 and Claude Mythos 5
Claude Fable 5 and Claude Mythos 5 represent the next evolution of AI capabilities, offering improvements in reasoning, content generation, coding assistance, research, and business automation. Explore their features, use cases, and potential impact on the future of artificial intelligence.
Claude Ai
Claude Free Models: What You Can Use Without Paying
Claude free models mainly refer to Sonnet access on claude.ai, with limits. Learn what is free, what is paid, and how professionals can use it well.
Trending Articles
What is AWS? A Beginner's Guide to Cloud Computing
Everything you need to know about Amazon Web Services, cloud computing fundamentals, and career opportunities.
Can DeFi 2.0 Bridge the Gap Between Traditional and Decentralized Finance?
The next generation of DeFi protocols aims to connect traditional banking with decentralized finance ecosystems.
Claude AI Tools for Productivity
Discover Claude AI tools for productivity to streamline tasks, manage workflows, and improve efficiency.