I like the idea and even considered contributing to the list, but this stopped me:
> NAQ (Never Asked Questions)
> My website is on your list!
> Cry about it.
That's quite a suspicious attitude. Clearly the maintainer believes he is infallible. I understand the emotions behind this, but this is not how a public blacklist should be maintained.
Yuuup. My personal website has been inaccessible to a few friends, they thought my server was down. It turned out they had some blocklist (not related to AI) installed on their PiHole, and for whatever reason my website was on that list. It is, in fact, to this day, because my request to unblock it went completely unanswered. I still don't know why the website is on the list.
Probably because there's about the same chance of them being innocent as the "Help I was wrongfully banned by VAC :(((" posts in the Counterstrike community.
I would add that with this attitude and how new this initiative is, there's very little chance it will still be updated 5 years from now. Really this sort of thing needs to come from Easylist or similar, who have a track record of maintaining these for years.
Also seems a bit hypocritical given the screed about how such a list is necessary because the AI content might output hallucinations or damaging content without review.
But if it’s the author’s blocklist that is wrong, unverified, and causing harm to others? Cry about it.
The broad list seems to just be a hater list. It's not trying to cover cases of deception (passing off AI material as if it's something else), as it includes sites which are very open about what kind of content is on there.
So there is a spreadsheet of websites. That is very interesting. There was an article here sometime ago about a media group who have so many super SEOd websites. They all have common footer text. I searched and added as many as I could find in uBlacklist. I have a gist listing them and how I searched for them. You might find that useful.
Ublock Origin also already has an “AI widget” blocklist you can enable. Literally the only extension that keeps me on Firefox because of how useless it is on Chromium.
Assuming you're talking about Manifest V2 deprecation: you can disable it in Chromium-based browsers by launching with `--disable-features=ExtensionManifestV2Unsupported,ExtensionManifestV2Disabled`, and enjoy uBlock once again. See also the github discussion: https://github.com/uBlockOrigin/uBlock-issues/discussions/29...
Personally I find that I prefer badly written english or auto-translated stuff written in languages foreign to me over ai generated or even just ai polished works I've seen. There is just so much more character, depth and variance there vs ultra ai generic or slop text.
That being said this project seems focused on content farms not people who just need a little help writing so this whole conversation is a bit of a side tangent.
I use Grammarly at work (it's mostly to make sure our brand guidelines are kept) and I don't find that it (defaultly) corrects too far into the ai slop territory. It's mostly just making sure your sentence is correct.
Op is going after AI slop bot farms like android authority
Glad we're moving in this direction, I've also got a tool that I use to determine if writing is AI using common tropes and reconstruct the OG prompt from it: https://tropes.fyi/aidr
That's a curious one, Twitter is worthless anyway. Before AI bots proliferated, the change to rank paid accounts high in replies turned it into a de facto entry level $8/month advertising tier.
Love this, I wish there were more and broader categories of sites one could block. You can always temporarily allow sites.
In the enterprise space, there are URL reputation providers. They categorize sites based on different criteria, and network administrators block or warn users based on that information.
In my humble opinion, there needs to be a crowdsourced fund (or ideally governments would take this seriously and fund it on behalf of people) for enabling technologies that allow user friendly internet experiences. Browsers, frameworks, vpn providers, site-reputation, deceptive content, dns-providers, email providers,trusted certificate authorities(no,google and microsoft shouldn't get to police that), nation-state or corporate affiliations,etc... You shouldn't need to setup a pi-hole.
Imagine a $1B/yr non-profit fund for this stuff. if 10M people paid $10/mo that's $1.2B/yr. Proton has $97M revenue in 2024 and 100M total accounts (I don't know how many pay but the spread is roughly $1/user). I really think now is the time to talk about this when so many are wary of US tech giants and looking for other opportunities.
I'm the weirdo who just closes websites with too many ads, and just mostly powers through the ads. If you have a sane setup for ads I will use your website. I'm tired of the years of adblock drama. Every time I come to these threads its completely different names for adblock plugins, it's like a rat race.
I get this. uBO on Firefox has "just worked" for me for a long time with 0 configuration (I don't mess with the blacklists), but on phones it's a different game, and on Chrome, and before uBO there was other drama.
I would rather have a whitelist that adds a nice tag at the end of the link, indicating that overall it has high quality content. This also forces you to periodically check the sites you've whitelisted
Meta question: do you guys feel the adblockers will maybe not be that important in the future? As for myself, I ended up to use just a few websites, but those are reputable and I don't mind a few ads they provide. The only adblock which is still very much needed is one for Youtube.
I used to run pihole on a Pi and now I directly run unbound, still on a Pi. The difference on a great many sites is night and day: you simply get way fewer ads. And that's just by using a DNS blocklist.
Occasionally I'll get one site that refuses to load because I've got an "adblocker" but most sites do work fine, just with way fewer ads.
flip it, and build green(organic) lists
perhaps work towards having sites than dont just, not use AI, but never talk about it
it's not just AI, search is a scam, no mojo in the world can extract the contact info for the business next door and the mountains of porncoin, scamulous garbage and hate news
taking up a full 50% of whats left, does in fact make a determined effort to greenwall a section of the web something to consider
Admirable idea and execution…but it does apply opposing evolutionary/economic pressure for AI-slop to become less detectable over time. AI will learn and adapt.
Metaphorically speaking, it’s the Borg we’re dealing with, not the Klingons. All Janeway did was slow the Borg’s progress.
Cory Doctorow wrote a story ~20 years ago about how the first sentient machines would be spam bots because their job is to pass as human, and anti-spam systems provide competitive evolutionary pressure.
I feel like this is a bit of a sinking ship. I suppose if you want to avoid known sources of slop then this works … but beyond that it’s a bit of a lost cause. It’s like sports betting — once it’s there then there’s no saying who is (ab)using it.
It's not perfect, but in time my search results have gone from the first several pages being mostly garbage to mostly all good. Sure, new spam sites crop up every few days, but it's a quick block.
Why? He posts high-quality content that's interesting if you care about that field. It's not my cup of tea, but it's pretty far from what this list tries to block.
I take it you're talking about the user here with the nick simonw? I find his comments on HN interesting and balanced: don't know why you think he should be filtered out.
I do use "blocklist" on new project and name my main "trunk" and not "master" but I'll both a) defend other's rights to use terms like blocklists and master and b) call out the virtue signalling ones who are trying to push a political agenda by trying to control thoughts (by attempting to control speech).
quiet35|9 days ago
> NAQ (Never Asked Questions)
> My website is on your list!
> Cry about it.
That's quite a suspicious attitude. Clearly the maintainer believes he is infallible. I understand the emotions behind this, but this is not how a public blacklist should be maintained.
TonyTrapp|9 days ago
Chris2048|9 days ago
Drupon|9 days ago
the_biot|9 days ago
DrammBA|9 days ago
> A personal list for uBlock Origin
wasmainiac|9 days ago
ycombinatrix|9 days ago
GaryBluto|9 days ago
[deleted]
well_ackshually|9 days ago
[deleted]
NeutralCrane|9 days ago
But if it’s the author’s blocklist that is wrong, unverified, and causing harm to others? Cry about it.
rdmuser|10 days ago
A nice alternative to this very broad anti ai list: https://github.com/laylavish/uBlockOrigin-HUGE-AI-Blocklist
Edit: Oh I should mention I found it through reddit and there is some good discussion there where they describe how they find stuff etc: https://www.reddit.com/r/uBlockOrigin/comments/1r9uo3j/autom...
Dwedit|10 days ago
smusamashah|9 days ago
Edit: https://gist.github.com/SMUsamaShah/6573b27441d99a0a0c792431...
xnx|10 days ago
tkel|9 days ago
throwatdem12311|9 days ago
stratos123|8 days ago
lifthrasiir|10 days ago
> All I hear is skill issue. Imagine needing an AI to write stuff.
Grammarly users (and underrepresented non-English speakers) would complain.
QuadmasterXLII|10 days ago
dangus|10 days ago
E.g., bought a domain that previously hosted AI content.
E.g., Whitehouse.com used to be a porn site, now it’s not.
rdmuser|10 days ago
That being said this project seems focused on content farms not people who just need a little help writing so this whole conversation is a bit of a side tangent.
duskdozer|10 days ago
jofzar|10 days ago
Op is going after AI slop bot farms like android authority
rererereferred|10 days ago
amelius|9 days ago
papichulo2023|9 days ago
ossa-ma|9 days ago
mh-|9 days ago
https://tropes.fyi/aidr/b184cf3a
https://tropes.fyi/aidr/9b132f92
dimava|9 days ago
driverdan|9 days ago
add-sub-mul-div|9 days ago
srid|9 days ago
I ask becaue it considers @lilycoy__ (an obvious AI generated account, as quoted by Robin Hanson <https://x.com/robinhanson/status/2025332066552819782>) to be "100% human"
https://i.imgur.com/NQHVcdM.png
notepad0x90|9 days ago
In the enterprise space, there are URL reputation providers. They categorize sites based on different criteria, and network administrators block or warn users based on that information.
In my humble opinion, there needs to be a crowdsourced fund (or ideally governments would take this seriously and fund it on behalf of people) for enabling technologies that allow user friendly internet experiences. Browsers, frameworks, vpn providers, site-reputation, deceptive content, dns-providers, email providers,trusted certificate authorities(no,google and microsoft shouldn't get to police that), nation-state or corporate affiliations,etc... You shouldn't need to setup a pi-hole.
Imagine a $1B/yr non-profit fund for this stuff. if 10M people paid $10/mo that's $1.2B/yr. Proton has $97M revenue in 2024 and 100M total accounts (I don't know how many pay but the spread is roughly $1/user). I really think now is the time to talk about this when so many are wary of US tech giants and looking for other opportunities.
giancarlostoro|9 days ago
zadikian|8 days ago
dotancohen|9 days ago
ramon156|9 days ago
lkm0|9 days ago
dgares|9 days ago
[1] https://github.com/alvi-se/ai-ublock-blacklist/commit/f6ee8d...
semiinfinitely|9 days ago
mixtureoftakes|9 days ago
greyman|9 days ago
diath|9 days ago
TacticalCoder|9 days ago
Occasionally I'll get one site that refuses to load because I've got an "adblocker" but most sites do work fine, just with way fewer ads.
Grom_PE|9 days ago
xboxnolifes|9 days ago
metalman|10 days ago
rishabhaiover|9 days ago
firebot|10 days ago
afcool83|10 days ago
Metaphorically speaking, it’s the Borg we’re dealing with, not the Klingons. All Janeway did was slow the Borg’s progress.
mapontosevenths|10 days ago
He may not be too far off.
alansaber|9 days ago
smohare|9 days ago
[deleted]
jadar|9 days ago
duskdozer|7 days ago
Dwedit|10 days ago
harladsinsteden|10 days ago
Joel_Mckay|9 days ago
The bots and SEO spammers already fill sites with garbage =3
KomoD|8 days ago
dhayabaran|9 days ago
[deleted]
meindnoch|9 days ago
[deleted]
nicbou|9 days ago
TacticalCoder|9 days ago
eclipticplane|9 days ago
filldorns|9 days ago
charonn0|9 days ago
TacticalCoder|9 days ago
nosrepa|9 days ago