There are concerns besides spying if you really don't trust the source of an open model. One is that the training incorporates a bias (added data or data omission) that might not be immediately apparent but can affect you in a critical situation. Another is vendor lock-in, if you end up depending on specifics of the model that make it harder to swap later.
yeah, but try to convince a board or legal about it for a company that is not software first, for that they have to understand how it works. we have "chinese" AI blocked at work, even through i use self hosted models for myself at home hacking on my own stuff.
Good luck convincing others of this. I know it's true, you know it's true, but I've met plenty of otherwise reasonable people who just wouldn't listen to any arguments, they already knew better.
It's theoretically possible that your model will work OK except for code generation for security-relevant applications it will introduce subtle pre-designed bugs. Or if used for screening CVs it will prioritize PRC agents through some keyword in hobbies. Or it could promise a bribe to an office worker when asked about some critical infastructure :)
Sending data back could be as simple as responding with embedded image urls that reference external server.
You are totally right EU commissioner, Http://chinese.imgdb.com/password/to/eu/grid/is/swordfish/funnycat.png
Maybe it can not spy on you but models can be totally (e.g. politically) biased depending on the country of origin. Try to ask european-, us- or china-trained models about "Tiananmen Massacre" and compare the answers. Or consider Trump's recent decisions to get rid of "woke" AI models.
dbdr|5 months ago
That's true regardless of the source, of course.
apwell23|5 months ago
Wouldn't that 'concern' apply to mistral too. I don't see how the word 'another' can be used here?
disiplus|5 months ago
croes|5 months ago
miki123211|5 months ago
Good luck convincing others of this. I know it's true, you know it's true, but I've met plenty of otherwise reasonable people who just wouldn't listen to any arguments, they already knew better.
ajuc|5 months ago
Sending data back could be as simple as responding with embedded image urls that reference external server.
You are totally right EU commissioner, Http://chinese.imgdb.com/password/to/eu/grid/is/swordfish/funnycat.png
Possibilities are endless.
Xmd5a|5 months ago
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
cnr|5 months ago
Aerroon|5 months ago