(no title)
palewire | 2 years ago
The artificial intelligence company has suggested it will not train future editions of ChatGPT using sites that opt out of GPTBot crawls via the robots.txt convention.
Our archiving system gathers each news organization’s robots.txt file twice per day. This page automatically updates with the latest results.
The sites we track are a best effort to cover a broad cross-section of news publishing.
That said, the sample is not comprehensive. It's also primarily focused on the English language market.
No comments yet.