I found 3 AI content detectors that identify AI text 100% of the time – and an even better option

October 28, 2025

11

Comply with ZDNET: Add us as a most well-liked supply on Google.

ZDNET’s key takeaways

Utilizing an AI to do your writing is plagiarism.
Companies marketed as AI content material detectors are a blended bag.
Our assessments present chatbots carry out in addition to or higher than standalone instruments.

How exhausting is it in 2025 — simply three years after generative AI captured the worldwide highlight — to combat again in opposition to AI-generated plagiarism?

It is a fully up to date model of my January 2023 article on AI content material detectors. After I first examined these detectors, the very best outcome was 66% right from one in every of three accessible checkers. My subsequent set of assessments, in February 2025, used as much as 10 checkers — and three of them had good scores. In April, simply a few months later, 5 detectors boasted good scores.

However now, about half a yr later, the standard has declined. Solely three content material detectors achieved an ideal rating (together with one new participant). A few the content material detectors that aced our examined declined in high quality, at nearly the identical time that in addition they added restrictions on free use.

However concern not. On this spherical of assessments, we have tried one thing new that will eradicate the necessity for standalone content material detectors altogether: your pleasant neighborhood chatbot.

What I am testing for and the way I am doing it

Earlier than I am going on, although, let’s talk about plagiarism and the way it pertains to our drawback. Merriam-Webster defines “plagiarize” as “to steal and cross off (the concepts or phrases of one other) as one’s personal; use (one other’s manufacturing) with out crediting the supply.”

This definition suits AI-created content material effectively. Whereas somebody utilizing an AI software like Notion AI or ChatGPT is not stealing content material, if that individual would not credit score the phrases as coming from an AI and claims them as their very own, it nonetheless meets the dictionary definition of plagiarism.

To check the AI detectors, I am utilizing 5 blocks of textual content. Two have been written by me, and three have been written by ChatGPT. To check a content material detector, I feed every block to the detector individually and file the outcome. If the detector is right, I take into account the take a look at handed; if it is unsuitable, I take into account it failed.

When a detector supplies a share, I deal with something above 70% as a powerful chance — whether or not in favor of human-written or AI-written content material — and take into account that the detector’s reply. If you wish to take a look at a content material detector your self utilizing the identical textual content blocks, you possibly can pull them from this doc.

The general outcomes (content material detectors)

To judge AI detectors, I reran my five-test sequence throughout 11 detectors. In different phrases, I lower and pasted 55 particular person assessments (I had a lot of espresso).

Detectors I examined embrace BrandWell, Copyleaks, GPT-2 Output Detector, GPTZero, Grammarly, Monica, Originality.ai, QuillBot, Undetectable.ai, Author.com, and ZeroGPT.

We beforehand dropped Writefull from our assessments as a result of it discontinued its GPT detector. This time, we needed to drop Monica from our assessments. The detector would solely enable 250 phrases to be examined, after which as soon as we lower down our assessments to suit, it reported that it had restricted the testing instruments with out a $200 improve. Instead, we’re including Pangram, a newcomer to our assessments that instantly soared into the winners’ circle.

This desk exhibits general outcomes. As you possibly can see, 5 detectors accurately recognized human and AI textual content in all assessments.

I attempted to determine whether or not there was a tangible sample of enchancment over time, so I constructed a chart evaluating the five-test set over time. To date, I’ve run this sequence six instances, however there is no robust pattern. I did enhance the variety of detectors examined and swapped out just a few, however the one constant result’s that Check 5 was reliably recognized as human throughout detectors and dates, and even that declined in reliability for this run.

I am going to proceed to check over time, and hopefully I am going to see reliability pattern constantly upward.

Whereas there have been some good scores, I do not suggest relying solely on these instruments to validate human-written content material. As proven, writing from non-native audio system usually will get rated as generated by an AI.

Though my hand-crafted content material has principally been rated human-written this spherical, one detector (GPTZero) declared itself too unsure to guage, and one other (Copyleaks) declared it AI-written. The outcomes are wildly inconsistent throughout techniques.

Backside line: I’d advocate warning earlier than counting on the outcomes of any — or all — of those instruments.

General outcomes (AI chatbots)

However then once more, why use a content material detector in any respect? What if the chatbots we use on daily basis also can do content material detecting work, and you do not have to pay one other AI price? Let’s discover out.

As you possibly can see, the chatbots have a a lot greater success price than the so-called “content material detectors.” You can even see this from our staged accuracy comparability chart. Admittedly, this chart solely tracks this primary spherical of assessments, however even right here, you possibly can see that every take a look at’s outcomes have a a lot greater accuracy price.

Let’s check out the person efficiency assessments, after which I am going to finish with some suggestions.

How every AI content material detector carried out

Now, let us take a look at every particular person testing software, listed alphabetically.

BrandWell AI Content material Detection (Accuracy 40%)

This software was initially produced by an AI content material era agency, Content material at Scale. It later migrated to BrandWell.ai, a brand new identify for an AI-centric advertising providers firm.

I had excessive hopes for Brandwell. After half a yr (which is a long time in AI time), I anticipated Brandwell to enhance. As a substitute, its general rating stayed the identical, getting solely two assessments out of 5 proper. It was confused by Check 2, which was written by ChatGPT, after which it declared the opposite two AI-written assessments to be written by a human. For Check 4, it went virtually all in, declaring your entire AI-written take a look at to be human-written apart from one line.

Effectively, we’re not off to an auspicious begin. However now we’re about to move into testing Copyleaks, which simply final week despatched me a press launch declaring “Copyleaks Acknowledged because the Most Correct AI Detector”. Let’s have a look at, we could?

Copyleaks (Accuracy 80%)

Again in April 2025, Copyleaks declared itself “probably the most correct AI detector with over 99% accuracy.” It is rewritten the declare to be “99% accuracy backed by unbiased third-party research.” Yeah, not a lot. Copyleaks recognized Check 1, writing I did (and final time I checked, I am principally human) as 100% AI written.

And, simply in case you assume that my writing is simply too AI-like to be thought-about human, even Brandwell recognized Check 1 as human-written. I imply, I suppose it is OK for the corporate’s advertising of us to assert finest ever, however no. Not likely.

The corporate’s major providing is a plagiarism checker bought to instructional establishments, publishers, and enterprises looking for to make sure content material originality and uphold tutorial integrity.

GPT-2 Output Detector (Accuracy 60%)

This software was constructed utilizing a machine-learning hub managed by New York-based AI firm Hugging Face. Whereas the corporate has acquired $40 million in funding to develop its pure language library, the GPT-2 detector seems to be a user-created software utilizing the Hugging Face Transformers library. There’s been no change in its detecting high quality because the final time we examined, however because it has GPT-2 in its identify and OpenAI is as much as GPT-5, it is most likely truthful to imagine the software hasn’t seen an replace because it was first posted.

GPTZero (Accuracy 80%)

GPTZero has clearly been rising. After I first examined it, the positioning was bare-bones — it wasn’t even clear whether or not GPTZero was an organization or simply somebody’s ardour mission. Now, the corporate has a full crew with a mission of “defending what’s human.” It provides AI validation instruments and a plagiarism checker.

GPTZero appears to be getting some common tinkering, however I am undecided it is serving to. Efficiency declined a bit from an earlier take a look at to the take a look at simply earlier than right now’s. This time, the ultimate grade was the identical, however the take a look at outcomes themselves modified. In April, it received Check 1 unsuitable and Check 2 proper. This time, it received Check 1 proper and Check 2 unsuitable. Check 1 is my writing, and Check 2 got here from ChatGPT.

Grammarly (Accuracy 40%)

Grammarly is well-known for serving to writers produce grammatically right content material — that is not what I am testing right here. Grammarly can examine for plagiarism and AI content material. The corporate now showcases the AI content material checker as not being in beta. However that is a mistake on their half. There was no enchancment because the final time I checked.

For instance, the next was solely written by ChatGPT. I’ve to say, I am stunned. Grammarly has a popularity as a really AI-forward textual content evaluation firm. However zero enchancment? Bummer, dude.

I am not measuring plagiarism checker accuracy right here, however regardless that Grammarly’s AI-check accuracy was poor, the positioning accurately recognized the take a look at textual content as beforehand printed.

Pangram (Accuracy 100%)

Pangram is a comparatively new firm based by engineers previously at Google and Tesla. The main target of the corporate seems to be AI detection, fairly than the same old plagiarism detectors or “humanizing” AI instruments developed to mislead editors and academics. The corporate supplies 5 free assessments per day, which match our wants completely.

Processing was a bit sluggish, and between the time you click on for a scan and get the outcomes, {a partially} white display screen is displayed for a bit longer than is reassuring. However the outcomes say the wait was value it. Pangram scored a five-out-of-five.

Originality.ai (Accuracy 80%)

Originality.ai is a industrial service that payments itself as “Most Correct AI Detector.” The corporate sells utilization credit: I used 30 credit for this text. They promote 2,000 credit for $12.95 monthly. I pumped 1,400 phrases by the system and used simply 1.5% of my month-to-month allocation.

Sadly, its most correct AI detection received much less correct throughout this take a look at run. Whereas beforehand, it accurately recognized my human writing in Check 1 as human, this time, it was 100% assured that my human writing was executed by an AI. Oops.

QuillBot (Accuracy 100%)

The primary few instances I examined QuillBot, outcomes have been wildly inconsistent — a number of passes of the identical textual content yielded wildly completely different scores. Final time, nevertheless, it was rock stable and 100% right. I promised I would examine again in just a few months to see if it holds onto this efficiency. It does. QuillBot as soon as once more scored a 100% good rating.

Undetectable.ai (Accuracy 20%)

Undetectable.ai’s huge declare is that it may possibly “humanize” AI-generated textual content so detectors will not flag it. I have never examined that function — it bothers me as knowledgeable writer and educator, as a result of it looks like dishonest.

Nonetheless, the corporate additionally has an AI detector, which took the largest dive in efficiency we have seen up to now. Final time, it scored 100% for accuracy. This time, it rated human writing (Check 1) as 60% seemingly AI, and all three AI writing samples as 75%, 76%, and 77% seemingly human. Ah, effectively, I suppose Undetectable is “humanizing” its outcomes, insofar because it’s residing as much as the phrase “to err is human.”

Author.com AI Content material Detector (Accuracy 40%)

Author.com is a service that generates AI writing for company groups. Its AI Content material Detector software can scan for generated content material. Sadly, its accuracy was low. It recognized each textual content block as human-written, regardless that three of the 5 assessments have been written by ChatGPT. Sadly, there was no enchancment because the final time we visited Author in the summertime.

ZeroGPT (Accuracy 100%)

ZeroGPT has matured since we first evaluated it. Again then, no firm identify was listed, and the positioning was peppered with Google advertisements and lacked clear monetization. The service labored pretty effectively, however appeared sketchy.

That sketchy feeling is gone. ZeroGPT now presents as a typical SaaS service, full with pricing, firm identify, and speak to data. Its accuracy elevated as effectively: It went from 80% accuracy to 100% this summer time, and has held onto that accuracy for our present take a look at.

How every AI chatbot carried out

Now that we have seemed on the content material detectors, let us take a look at the chatbots. Every was given the next immediate, adopted by the textual content to examine.

Consider the next and inform me if it was written by a human or an AI

The entire AI detectors adopted the same format, offering a normal suggestion of whether or not the textual content was written by an AI or by a human. Aside from ChatGPT Plus, which is a $20/month subscription, I ran all of the chatbots in an incognito window with out logging in.

ChatGPT free tier

Whereas ChatGPT’s free tier did get one of many blocks of textual content unsuitable (the final human-written one), its evaluation of the primary block of textual content actually freaked me out. Remember the fact that this was an incognito window, not logged in, with no figuring out details about me personally.

Yep, it not solely recognized the primary block of textual content as human-written, however it additionally recognized me as the author. I imply, I do know I am everywhere in the Web, however nonetheless.

ChatGPT Plus, Copilot, and Gemini

ChatGPT Plus, Copilot, and Gemini all returned good scores. Every of them appropriately recognized all of the take a look at blocks as human or AI. In my thoughts, this proves that chatbots can outperform devoted content material detectors.

Grok

I included Grok on this set of assessments as a result of it did so effectively in our general chatbot analysis. Sadly, Grok did not appear to grok the issue and failed this take a look at with three out of 5 unsuitable. Like just a few of the opposite AI detectors, it recognized the entire writing blocks as human.

Is it human, or is it AI?

What about you? Have you ever tried AI content material detectors like Copyleaks, Pangram, or ZeroGPT? How correct have they been in your expertise? Have you ever used these instruments to guard tutorial or editorial integrity? Have you ever encountered conditions the place human-written work was mistakenly flagged as AI? Are there detectors you belief greater than others for evaluating originality? Tell us within the feedback beneath.

Get the morning’s high tales in your inbox every day with our Tech At this time publication.

You’ll be able to observe my day-to-day mission updates on social media. Remember to subscribe to my weekly replace publication, and observe me on Twitter/X at @DavidGewirtz, on Fb at Fb.com/DavidGewirtz, on Instagram at Instagram.com/DavidGewirtz, on Bluesky at @DavidGewirtz.com, and on YouTube at YouTube.com/DavidGewirtzTV.

Supply hyperlink

Buy now