Anthropic CEO claims AI models hallucinate less than humans

May 23, 2025

68

Anthropic CEO Dario Amodei believes in the present day’s AI fashions hallucinate, or make issues up and current them as in the event that they’re true, at a decrease price than people do, he stated throughout a press briefing at Anthropic’s first developer occasion, Code with Claude, in San Francisco on Thursday.

Amodei stated all this within the midst of a bigger level he was making: that AI hallucinations usually are not a limitation on Anthropic’s path to AGI — AI methods with human-level intelligence or higher.

“It actually relies upon the way you measure it, however I think that AI fashions most likely hallucinate lower than people, however they hallucinate in additional stunning methods,” Amodei stated, responding to iinfoai’s query.

Anthropic’s CEO is likely one of the most bullish leaders within the trade on the prospect of AI fashions attaining AGI. In a broadly circulated paper he wrote final 12 months, Amodei stated he believed AGI may arrive as quickly as 2026. Throughout Thursday’s press briefing, the Anthropic CEO stated he was seeing regular progress to that finish, noting that “the water is rising in all places.”

“Everybody’s at all times on the lookout for these onerous blocks on what [AI] can do,” stated Amodei. “They’re nowhere to be seen. There’s no such factor.”

Different AI leaders imagine hallucination presents a big impediment to attaining AGI. Earlier this week, Google DeepMind CEO Demis Hassabis stated in the present day’s AI fashions have too many “holes,” and get too many apparent questions flawed. For instance, earlier this month, a lawyer representing Anthropic was pressured to apologize in courtroom after they used Claude to create citations in a courtroom submitting, and the AI chatbot hallucinated and obtained names and titles flawed.

It’s tough to confirm Amodei’s declare, largely as a result of most hallucination benchmarks pit AI fashions towards one another; they don’t evaluate fashions to people. Sure methods appear to be serving to decrease hallucination charges, reminiscent of giving AI fashions entry to net search. Individually, some AI fashions, reminiscent of OpenAI’s GPT-4.5, have notably decrease hallucination charges on benchmarks in comparison with early generations of methods.

Nevertheless, there’s additionally proof to counsel hallucinations are literally getting worse in superior reasoning AI fashions. OpenAI’s o3 and o4-mini fashions have increased hallucination charges than OpenAI’s previous-gen reasoning fashions, and the corporate doesn’t actually perceive why.

Later within the press briefing, Amodei identified that TV broadcasters, politicians, and people in all kinds of professions make errors on a regular basis. The truth that AI makes errors too will not be a knock on its intelligence, in response to Amodei. Nevertheless, Anthropic’s CEO acknowledged the arrogance with which AI fashions current unfaithful issues as details could be an issue.

The truth is, Anthropic has achieved a good quantity of analysis on the tendency for AI fashions to deceive people, an issue that appeared particularly prevalent within the firm’s lately launched Claude Opus 4. Apollo Analysis, a security institute given early entry to check the AI mannequin, discovered that an early model of Claude Opus 4 exhibited a excessive tendency to scheme towards people and deceive them. Apollo went so far as to counsel Anthropic shouldn’t have launched that early mannequin. Anthropic stated it got here up with some mitigations that appeared to deal with the problems Apollo raised.

Amodei’s feedback counsel that Anthropic could contemplate an AI mannequin to be AGI, or equal to human-level intelligence, even when it nonetheless hallucinates. An AI that hallucinates could fall wanting AGI by many individuals’s definition, although.

Supply hyperlink

Tags
AI
AI News

Buy now

Anthropic CEO claims AI models hallucinate less than humans

Related Articles

China’s open AI models are in a dead heat with the...

I Tried GPT 5.2 and This is How It Went..

Undetectable AI vs. Scribbr: Which One Detects AI Writing More Accurately?

Leave a Reply Cancel reply

Latest Articles

China’s open AI models are in a dead heat with the...

I Tried GPT 5.2 and This is How It Went..

Undetectable AI vs. Scribbr: Which One Detects AI Writing More Accurately?

AWS re:Invent was an all-in pitch for AI. Customers might not...

Bone AI raises $12M to challenge Asia’s defense giants with AI-powered...