Elon Musk just released an AI that’s smarter than ChatGPT — here’s why that matters

February 19, 2025

81

Table of Contents

Elon Musk’s synthetic intelligence startup xAI has unveiled Grok 3, its newest AI mannequin that the corporate claims outperforms main opponents throughout key technical benchmarks. The announcement marks a major escalation within the race to develop extra highly effective AI methods.

The launch comes simply days after Musk’s failed $97.4 billion bid to accumulate OpenAI, the corporate he co-founded with Sam Altman in 2015. Throughout a livestreamed demonstration on X, Musk characterised Grok 3 as “an order of magnitude extra succesful than Grok 2” and emphasised its capacity to cause by means of advanced issues.

Early testing seems to help a few of xAI’s claims. The mannequin topped the influential Chatbot Area leaderboard, scoring greater than OpenAI’s GPT-4o, Google’s Gemini and DeepSeek’s V3 mannequin in blind person testing. Revealed benchmarks present Grok 3 attaining superior scores in arithmetic (AIME ’24), scientific reasoning (GPQA) and coding duties.

Grok 3 leads the Chatbot Area leaderboard with a rating of roughly 1400, considerably outperforming different main AI fashions in blind person testing. (Supply: xAI)

Inside Grok 3’s huge computing infrastructure: 200,000 GPUs and a brand new knowledge heart

“Grok 3 clearly has round cutting-edge considering capabilities,” wrote former OpenAI researcher Andrej Karpathy in an X put up after early-access testing. “Few fashions get this proper reliably. The highest OpenAI considering fashions get it too, however all of DeepSeek-R1, Gemini 2.0 Flash Pondering, and Claude don’t.”

The mannequin’s growth required huge computational sources. xAI doubled its GPU cluster to 200,000 Nvidia chips for coaching, housed in a brand new Memphis knowledge heart. This infrastructure funding highlights the growing computational calls for of superior AI growth, as firms race to construct extra succesful methods.

I used to be given early entry to Grok 3 earlier at the moment, making me I believe one of many first few who may run a fast vibe test.

Pondering
✅ First, Grok 3 clearly has an round cutting-edge considering mannequin (“Suppose” button) and did nice out of the field on my Settler’s of Catan… pic.twitter.com/qIrUAN1IfD

— Andrej Karpathy (@karpathy) February 18, 2025

DeepSearch and superior reasoning: how Grok 3 goals to outsmart ChatGPT and Google Gemini

A key innovation is Grok 3’s “DeepSearch” characteristic, which mixes net looking out with reasoning capabilities to investigate info from a number of sources. The system additionally contains specialised modes for advanced problem-solving, together with a “Suppose” perform that reveals its reasoning course of and a “Huge Mind” mode that allocates extra computing energy to troublesome duties.

“The factor to actually take note of in AI is studying pace. And @xai is studying method sooner than every other,” posted tech business veteran Robert Scoble, citing a dialog with Apple Siri cofounder Tom Gruber.

Grok 3 benchmarks.

The factor to actually take note of in AI is studying pace. And @xai is studying method sooner than every other.

Who stated that?

Apple Siri cofounder Tom Gruber. He instructed me at dinner a decade in the past that that’s an important factor to concentrate to. pic.twitter.com/yWCiJsN9pU

— Robert Scoble (@Scobleizer) February 18, 2025

Nevertheless, some limitations emerged throughout testing. Karpathy famous that the mannequin generally fabricates citations and struggles with sure forms of humor and moral reasoning duties. These challenges are frequent throughout present AI methods and spotlight the continued difficulties in creating actually human-like synthetic intelligence.

Scale.ai CEO Alexandr Wang praised the discharge, tweeting: “Grok 3 is a brand new greatest mannequin on this planet from the @xai group!” He famous its superior efficiency on numerous benchmarks and expressed enthusiasm for future collaboration.

Grok 3 is a brand new greatest mannequin on this planet from the @xai group!

Grok 3 ranks #1 on Chatbot Area w/a giant hole, and scores impressively on pretraining and reasoning evals.

congrats to @elonmusk @ibab @jimmybajimmyba @Yuhu_ai_

trying ahead to extra partnership on grok4 & past ? pic.twitter.com/BrPGz17P51

— Alexandr Wang (@alexandr_wang) February 18, 2025

AI business competitors heats up: what Grok 3’s launch means for OpenAI, DeepSeek and the way forward for synthetic intelligence

The mannequin shall be accessible by means of X’s Premium+ subscription ($40/month) and a brand new standalone “SuperGrok” service ($30/month). Enterprise API entry is deliberate for the approaching weeks.

This launch intensifies competitors within the AI business, notably as Chinese language startup DeepSeek lately demonstrated comparable efficiency with reportedly decrease computational necessities. The event additionally raises questions in regards to the sustainability of the computational arms race in AI, as firms make investments billions in more and more highly effective {hardware} infrastructure.

In key efficiency benchmarks, Grok 3 and its mini variant present superior scores throughout arithmetic, science and coding exams in comparison with competing fashions from Google, OpenAI, Anthropic and DeepSeek. The total-size Grok 3 mannequin (darkish blue) achieved notably robust leads to scientific reasoning. (Supply: xAI)

Musk emphasised that Grok 3 stays in beta, with enhancements anticipated “nearly each day.” The corporate plans so as to add voice interplay capabilities inside weeks and can open-source its earlier mannequin, Grok 2, as soon as the brand new model stabilizes.

But maybe probably the most telling side of Grok 3’s debut isn’t its technical specs or benchmark scores, however what it represents: the mounting rigidity between Musk and his former colleagues at OpenAI. Simply days after his failed $97.4 billion bid to accumulate OpenAI, Musk has unveiled a mannequin that challenges its supremacy — suggesting that within the high-stakes race for AI dominance, even a rejected suitor can change into a formidable rival.

Supply hyperlink

Tags
AI
AI News

Buy now

Elon Musk just released an AI that’s smarter than ChatGPT — here’s why that matters

Inside Grok 3’s huge computing infrastructure: 200,000 GPUs and a brand new knowledge heart

DeepSearch and superior reasoning: how Grok 3 goals to outsmart ChatGPT and Google Gemini

AI business competitors heats up: what Grok 3’s launch means for OpenAI, DeepSeek and the way forward for synthetic intelligence

Related Articles

Bose QuietComfort Ultra vs. Sony WH-1000XM6: I tried the two best...

Hiring specialists made sense before AI — now generalists win

Top 10 AI Models For Web Development in 2025

Leave a Reply Cancel reply

Latest Articles

Bose QuietComfort Ultra vs. Sony WH-1000XM6: I tried the two best...

Hiring specialists made sense before AI — now generalists win

Top 10 AI Models For Web Development in 2025

‘ONE RULE’: Trump says he’ll sign an executive order blocking state...

Anthropic and Accenture sign multi-year AI strategic partnership