top | item 38714469

(no title)

Footnote7341 | 2 years ago

Such a vanishingly small percentage of the images its not even worth calculating. Of course search engines also contain these links, LAION-5B only contains links not images as well....

protect the kids!!! or something. Can we do a scandal about how many 'extremist' images are in the data-set next too? anti-vaxer, climate denier, nazi, religious extremist propaganda, scientific misinformation. Maybe we're all safer off using corporate models with closed data sets so no one gets any of the wrong ideas.

discuss

order

WheatMillington|2 years ago

For the children who have been victimised, I doubt the small percentage provides any comfort.

ipaddr|2 years ago

The victims have already been victimized many years ago. Finding this content in a training set doesn't re-victimize them.

nulld3v|2 years ago

Wtf, if you can get rid of the images why wouldn't you? Clearly it's not an exorbitant amount of work as shown in the paper, they use very conventional techniques.

colechristensen|2 years ago

CSAM does not fall under “freedom of expression”, everything else you list does. It’s not a slippery slope.

Resist anyone’s attempt to make everything else you list illegal to express or possess.