(no title)
michaelcampbell | 11 days ago
Or is this file meant to be "read" by an LLM long after the entire site has been scraped?
michaelcampbell | 11 days ago
Or is this file meant to be "read" by an LLM long after the entire site has been scraped?
hamdingers|11 days ago
I've done honeypot tests with links in html comments, links in javascript comments, routes that only appear in robots.txt, etc. All of them get hit.
efreak|11 days ago
dumbfounder|11 days ago
reconnecting|11 days ago
I assume that there are data brokers, or AI companies themselves, that are constantly scraping the entire internet through non-AI crawlers and then processing data in some way to use it in the learning process. But even through this process, there are no significant requests for LLMs.txt to consider that someone actually uses it.
olivia-banks|11 days ago
giancarlostoro|11 days ago