top | item 36544337

(no title)

RoyGBivCap | 2 years ago

"Several hundred organizations (maybe more) were scraping Twitter data extremely aggressively, to the point where it was affecting the real user experience.

What should we do to stop that? I’m open to ideas."

https://twitter.com/elonmusk/status/1674898695534309378

"1. Scraping is already disallowed by T&C.

2. The scraping orgs dgaf & mask their IPs through proxy servers or through orgs that appear legit. For example, a recent massive scraping operation originating from Oracle IP addresses was just using their servers as a laundromat.

3. We absolutely will take legal action against those who stole our data & look forward seeing them in court, which is (optimistically) 2 to 3 years from now."

https://twitter.com/elonmusk/status/1674898695534309378

discuss

order

agnosticmantis|2 years ago

> 3. We absolutely will take legal action against those who stole our data…

What does “our” refer to here? Does Twitter (i.e. musk) own the data in any sense? Or does he mean it as “we the people’s data”?

Very off-putting to read that sentence. Obviously he’s trying to monetize the user generated data in this LLM rush as other avenues to monetizations have flopped.

Jackson__|2 years ago

This also really sounds like he's trying to pretend his data is some kind of rare commodity, when the reality is that it's bottom of the barrel trash as far as text data for LLMs goes.

unethical_ban|2 years ago

I imagine part of the terms of using Twitter give the corporation ownership of comments. As is their right.

RoyGBivCap|2 years ago

I can't speak for him, just relaying the information.

But I'm happy to speculate: Organizations violated the twitter TOS by scraping, and he's going to sue the organizations for it.

GolfPopper|2 years ago

If he wants to be taken seriously, perhaps Mr. Musk can post the data somewhere others can read it? Maybe a Mastadon server?

omoikane|2 years ago

I thought the typical response would be rate limit plus captcha.

o1y32|2 years ago

Exactly. This is a (mostly) solved problem - if LinkedIn can do it without completely locking down the website, Twitter can as well