25.4 C
New York
Wednesday, August 6, 2025

Buy now

Anthropic’s powerful Opus 4.1 model is here – how to access it (and why you’ll want to)

ZDNET’s key takeaways

  • Anthropic launched Claude Opus 4.1. 
  • The mannequin exceeds the predecessor’s efficiency on complicated duties. 
  • It’s out there to paid Claude customers, Claude Code, API, Amazon Bedrock, and Google Cloud’s Vertex AI.

In Might, Anthropic launched Claude Opus 4, which the corporate dubbed its strongest mannequin but and the perfect coding mannequin on the earth. Solely three months later, Anthropic is upping the ante additional by launching the extremely anticipated Claude Opus 4.1, which now takes its predecessor’s crown as Anthropic’s most superior mannequin. 

The Opus household of fashions is the corporate’s most superior, clever AI fashions geared towards tackling complicated issues. Consequently, Claude Opus 4.1, launched on Tuesday, excels at these duties and may even one-up its predecessor on agentic duties, real-world coding, and reasoning, based on Anthropic. 

The mannequin additionally comes because the trade is anticipating the launch of OpenAI’s GPT-5 quickly.

How does Claude Opus 4.1 carry out?

Probably the most spectacular use circumstances of Claude Opus 4 was its efficiency on the SWE-bench Verified, a human-filtered subset of the SWE-bench, a benchmark that evaluates LLMs’ skills to unravel real-world software program engineering duties sourced from GitHub. Claude Opus 4’s efficiency on the SWE-bench Verified supported the declare that it was the “greatest coding mannequin on the earth.” As seen within the submit above, Opus 4.1 carried out even larger. 

Claude Opus 4.1 additionally swept its previous fashions throughout the benchmark board, together with the MMMLU, which exams for multilingual capabilities; AIME 2025, which exams for rigor on highschool match competitors questions; GPQA, which exams for efficiency on graduate-level reasoning prompts; and extra. When pinned towards rivals’ reasoning fashions, together with OpenAI o3 and Gemini 2.5 Professional, it outperforms them in numerous benchmarks, together with SWE-bench Verified. 

See also  Deep Cogito emerges from stealth with hybrid AI ‘reasoning’ models

With the discharge, Anthropic additionally posted its system card, which delineates all the security assessments and evaluations it performed on the mannequin, in addition to its weaknesses, dangers, and limitations. A fast overview of the 22-page doc reveals that the mannequin was deployed with an AI Security Degree 3 (ASL-3) Customary beneath Anthropic’s Accountable Scaling Coverage (RSP), and it’s nonetheless liable to a lot of the similar vulnerabilities. 

How are you going to entry Claude Opus 4.1?

If you wish to strive the mannequin for your self, it’s now out there to everybody by way of the paid Claude plans, which embody Claude Professional for $20 monthly and Claude Max for $100 monthly. It’s out there in Claude Code, the API, Amazon Bedrock, and Google Cloud’s Vertex AI.

Get the morning’s high tales in your inbox every day with our Tech Immediately e-newsletter.

Supply hyperlink

Related Articles

Leave a Reply

Please enter your comment!
Please enter your name here

Latest Articles