Wait a minute - the attackers were using the API to ask Claude for ways to run a cybercampaign, and it was only defeated because Anthropic was able to detect the malicious queries? What would have happened if they were using an open-source model running locally? Or a secret model built by the Chinese government?I just updated by P(Doom) by a significant margin.
CGamesPlay|3 months ago
In all likelihood, the exact same thing that is actually happening right now in this reality.
That said, local models specifically are perhaps more difficult to install given their huge storage and compute requirements.
alganet|3 months ago
Local models are a different thing than those cloud-based assistants and APIs.
lmm|3 months ago
Not necessarily. Oracle has made billions selling a database that's less good than plain open-source ones, for example.
jimbohn|3 months ago
pixl97|3 months ago
Governments of course will have specially trained models on their corpus of unpublished hacks to be better at attacking than public models will.