Unfortunately there is no way to combat this, and it seems like the end of the internet we once knew. Even with a “proof of human” technology, people could still just paste whatever AI-generated text they wanted, under their “real” account.
This has likely been going on since the first ChatGPT was released.
I am moderating an art subreddit with about 2m users and the AI „art“ spam is getting really annoying to moderate. I don’t even understand what the purpose of these accounts is.
There are ways to combat it -- LLM-generated text leaves statistical fingerprints that appear to endure across big foundation model generations.
I'm working on Binoculars with some UMD and CMU folks and wanted to test it out on this. I downloaded one bot's comment history (/u/markusrorscht). 30% of the comments rated human-like, compared to 95-100% of comments from a few human users.
So, practically speaking, statistical methods are still able to provide a fingerprinting method, and one that gets better as comment history gets longer. And they can be combined with other bot detection methods. IMO bot detection will stay a cat-and-mouse game, rather than (LLM-powered) bots winning the whole thing.
If I read a comment that has any probability of changing my mind about a fact or opinion, I always go to the user page to check their registration date. No hard cut-off date but I usually discount or ignore any account >= 2020.
FinnKuhn|10 months ago
codeduck|10 months ago
arccy|10 months ago
heyitsguay|10 months ago
I'm working on Binoculars with some UMD and CMU folks and wanted to test it out on this. I downloaded one bot's comment history (/u/markusrorscht). 30% of the comments rated human-like, compared to 95-100% of comments from a few human users.
So, practically speaking, statistical methods are still able to provide a fingerprinting method, and one that gets better as comment history gets longer. And they can be combined with other bot detection methods. IMO bot detection will stay a cat-and-mouse game, rather than (LLM-powered) bots winning the whole thing.
butlike|10 months ago
robertk|10 months ago
sureglymop|10 months ago
probably_a_gpt|10 months ago
so you prefer authority of the messenger over merit of the message?
blibble|10 months ago
hoseja|10 months ago
butlike|10 months ago
giancarlostoro|10 months ago
computerthings|10 months ago
[deleted]