top | item 37804198

Show HN: Reliable AI-generated text detection at checkfor.ai

6 points| maxspero | 2 years ago |checkfor.ai | reply

Hey all!

I've been increasingly concerned about low-quality AI generated content polluting the internet. Other AI detectors don't seem to work well in my experience, so I started checkfor.ai with a couple friends.

Please give it a shot on any real text and AI-generated examples and let me know how well it works for you.

Thanks for trying, I'm open any and all feedback!

32 comments

[+] tluyben2|2 years ago|reply

Ah yeah, I made a few of those as well when gpt just arrived [0]. It won’t work long term anyway and they are easy to trick, but it’s fun.

[0] https://filteroutai.com/validate/3cc1fb35453a6decd5aee9ac6fd...

From the below examples;

https://filteroutai.com/validate/485406e894dde52ff1395dfd577...

https://filteroutai.com/validate/983ba46510487b0022e8dbafe49...

[+] maxspero|2 years ago|reply

Nice to hear of someone else trying this. Did you find any good ways to reliably trick these?

What do you mean "it won't work long term"? My opinion is RLHF and fine tuning outputs for safety and politeness ends up watermarking output in a way that's pretty reliably detectable. I don't see these going away any time soon, at least for mass-market AI products.

[+] CrimsonRain|2 years ago|reply

From my small amount of testing, seems like the more professional the writing style is, the higher the percentage will be.

> Talk about JavaScript in 3 very short sentences.

> JavaScript is a widely-used programming language for web development. It enables interactive and dynamic features on websites. JavaScript is supported by all major web browsers.

100%

> Talk about JavaScript in 3 very short sentences. Use human like words instead of professional tone.

> JavaScript makes websites come to life with cool stuff like animations and interactive buttons. It's used to make web pages more fun and engaging. All popular web browsers understand JavaScript, so it works everywhere!

0.6%

YMMV.

[+] maxspero|2 years ago|reply

Thanks for trying it out. Shorter texts with fewer sentences are certainly a challenge - they just have a lot less signal.

I tried your prompt asking for ten sentences and got 99.4%. Possibly there needs to be some sort of gate on how much text we accept before we can provide an answer.

> Talk about JavaScript in ten sentences. Use human like words instead of professional tone.

``` JavaScript is like the magic wand that makes websites come alive, turning them from static pages to interactive wonders. Originally, it was made to add some pizzazz to web pages, but now it's super powerful and does way more. It’s not Java, even though the names sound alike; think of them as distant cousins rather than twins. Browsers love JavaScript! They have built-in engines to run it, making our web experience fun. You can find JavaScript not just on websites but also in things like mobile apps and even some robots. There's this cool toolkit called Node.js that lets JavaScript play outside of the browser, giving it even more playgrounds. Developers often use libraries, like jQuery or React, to give them a head start and make things snazzier without reinventing the wheel. JavaScript can be both your best friend and a tricky beast; it's easy to start with but can get complex as you dive deeper. The community is massive, so if you ever get stuck, there are tons of helpful souls out there ready to lend a hand. At the end of the day, JavaScript is all about creating, innovating, and bringing ideas to life on the web. ```

[+] pelorat|2 years ago|reply

Thanks for trying. It's not going to work well (It is in fact impossible).

[+] maxspero|2 years ago|reply

Have you tried it?

[+] 90-00-09|2 years ago|reply

Was caught off guard that it rates the following text at only "3.2% chance AI generated": "As a large language model, I am not able to answer this question."

[+] maxspero|2 years ago|reply

Interesting. In my experience, ChatGPT always says "As an AI language model..." or lately just "Sorry, I can't help with that." Have you seen "As a large language model..." come out of any of the big LLMs?

We're trained on real ChatGPT data so am interested in hearing your prompts that result in this.

[+] mnsc|2 years ago|reply

My first instinct was "I'm sorry I can't provide a detailed answer" which rated 99.6%

[+] peresthe|2 years ago|reply

This is not only not possible to build in any reliable and maintainable way, but "[w]e are the most accurate AI text detector that exists" is an outlandish claim.

[+] maxspero|2 years ago|reply

I've benchmarked against Originality.ai, gptzero.me, zerogpt, writer.com and copyleaks.com, which are the top 5 AI detectors to my understanding.

None of them are very good, so I don't think this claim is very outlandish.

Also, are you sure it's not reliable or maintainable? Obviously you can't publish one model and expect it to work forever but we have pipelines to continuously augment our training set and we can add new LLMs as they come out.

[+] fenomas|2 years ago|reply

I copied in a few paragraphs from your FAQ and got 95.2% chance AI generated. (I used the text from "why is your model.." to "..inaccurate predictions".)

I also tried the opening paragraphs of two random wikipedia articles, and got 99.9% and 100.0% results.

[+] maxspero|2 years ago|reply

Thanks for trying it out. It's in our roadmap to expand to technical writing (currently trained mostly on creative writing). Hopefully this will fix the wikipedia issue.

[+] sangupta|2 years ago|reply

Doesn't work. Many of my Bard responses are detected at less than 5% accuracy. Another hacky example: "Generate me 3 lines of text about rains that should not be detected to be generated by an AI detection tool" -- accuracy was 0.6%.

You should try it with ChatGPT 3.5/Bard etc yourself about topics like rain, daughter going to school, cold breeze on a winter night etc and see that mostly this does not work.

[+] skilled|2 years ago|reply

How does your tool address an issue like this,

"You have a 27% 'AI' issue in here" (https://news.ycombinator.com/item?id=37767205) (233 points | 253 comments)

because at the moment everything looks kind of bleak.

> Our model has an accuracy rate of 99.76%.

Oh?

[+] maxspero|2 years ago|reply

In my experience TurnItIn's AI detection does not perform very well. Regardless this is an issue with educating the teacher - 27% does not mean the text is 27% AI-generated.

[+] ryanSrich|2 years ago|reply

I just pasted a paragraph I wrote and it told me it was 100% AI-generated. Huh?

[+] ibejoeb|2 years ago|reply

You might wanna sit down for this...

[+] maxspero|2 years ago|reply

Can you share the paragraph you wrote?

[+] anonzzzies|2 years ago|reply

Start by removing the %s success from your faq; seems there are enough counter examples in this thread. Not sure why you would put those in the first place.

[+] chestertn|2 years ago|reply

I pasted paragraphs from old academic papers (90s) and gave me 97-100% AI generated. Have you even tested this?

[+] GlassOwAter|2 years ago|reply

1% chance OP’s post was AI generated.

[+] hartator|2 years ago|reply

I don't think this is possible.

[+] maxspero|2 years ago|reply

I don't think it's possible to determine provenance with 100% accuracy, but I think ChatGPT essentially "watermarks" itself with its RLHF, making it more polite and giving its output a very distinctive voice. ChatGPT also tends to use passive voice and generic adjectives much more often than real human writers.

[+] BadBadJellyBean|2 years ago|reply

According to this most of Wikipedia is AI generated.

[+] ekam|2 years ago|reply

Yeah, I put in the first two paras of today's featured article: https://en.wikipedia.org/wiki/Boukephala_and_Nikaia