top | item 40002399

(no title)

tomfreemax | 1 year ago

I have to say I don't really get the website either. If the author is against scraper why not serve massive dummy content that it bloats their storage? Why all this linking? Maybe it's used to build (fake) page rank credibility and sometimes a link to one of the content farm pages is referenced on other pages, so these get boosted then?

discuss

order

relaxing|1 year ago

Presumably he would be paying for egress of those massive files?

ImPostingOnHN|1 year ago

So render it clientside and hope the crawler understands javascript?

Maybe run your own training in javascript, too, and use OpenAI's crawlers' compute for it.