(no title)
koboll | 2 years ago
Honestly, given that ChatGPT produces higher quality output, the next generation of spam blogs will (sadly) probably be an improvement over the sea of crap we have now.
koboll | 2 years ago
Honestly, given that ChatGPT produces higher quality output, the next generation of spam blogs will (sadly) probably be an improvement over the sea of crap we have now.
notahacker|2 years ago
I think that's questionable. Gibberish is easily skipped over, bargain basement ESL writing with no familiarity with the product sticks out a mile, but with GPT you get a kind of reverse "uncanny valley" effect where it's written in polished English and says just enough in the opening paragraphs that sounds informed to lead you to read half the piece before you realise it's machine generated bullshit. So you waste a lot more time on something that has no more insight (and cost even less to produce)
majormajor|2 years ago
So if demand stays the same, but the cost to produce it gets even lower, the ecosystem can support more players each pumping out their own version of all the spam sites. Stupid fake number example: Instead of spending $1000 a month to push out 100 articles, you spend $200 a month to push out 100 articles, and now only need to bring in $300 instead of $1100. So now there's another $800 worth of views that others could capture without putting you in the red, while letting them be in the green. So we may just end up with even more blogspam pushing the unique sources down the results list in Google. It's not likely to be meaningfully better based on my experience with "naive" prompting (the people creating these articles aren't going to be tuning their prompts to get good stuff per topic, they'll just get the quick and dirty stuff).
But I don't think that's where it's gonna get fun.
You know where there could be a lot of demand for higher-quality, harder-to-spot spam? Corporate product marketing and politics. Astroturfing in far more writing styles for far less money. Individuals running more fake accounts on Reddit, HN, etc, pushing more "unique"-but-repetitive copies of their views, while sprinkling in a fair bit of on topic info on a wider variety of other topics.
barbariangrunge|2 years ago
kristianp|2 years ago
paulgb|2 years ago
Hah, yep. I used to build tools to detect it at an adtech startup. A common approach at the time was to take someone else's text and use naive thesaurus replacement so that it would be just barely comprehensible but statistically look like english. So “You can catch the mouse” might become “you jar trap the rodent”. Glad to see how far technology has progressed! /s
duskwuff|2 years ago
There was some research a few years ago [2] into just how widespread this issue was in scientific publishing. The situation has likely only gotten worse with the introduction of higher-quality text generation LLMs.
[1]: https://ieeexplore.ieee.org/document/8597261
[2]: https://arxiv.org/abs/2107.06751
TechBro8615|2 years ago
seanp2k2|2 years ago
bko|2 years ago
transcriptase|2 years ago
visarga|2 years ago
Therefore, some form of reputation system remains necessary, like those used for scientific publications. I predict websites that provide a trusted reputation system will have a lot to gain in the future. Github stars, upvotes, retweets, citation count, or just good moderation - they will be essential in solving the spam/hallucination problem.
rchaud|2 years ago
We haven't even solved the email spam problem, and that started 30 years ago! We have simply accepted that the largest players in the business (MS, Gmail, Yahoo) will decide on our behalf which emails we will actually see. If you want to start your own email service, fine, just keep in mind that the big players might think you're a spammer.
Kim_Bruning|2 years ago
Mission accomplished?
seer|2 years ago
Totally off topic, but I just think that the producers of shows like black mirror should just hire the guy and he’ll come up with plots for really disturbing things not the mellow obvious crap they try to push now…
rdtsc|2 years ago
thih9|2 years ago
It does sound like an improvement short term, but long term I wish there was another option.
hot_gril|2 years ago
klntsky|2 years ago
Maybe even to an extent such that they will be actually useful
kristianp|2 years ago