mklond
|
2 years ago
|
on: Gandalf – Game to make an LLM reveal a secret password
Apologies for that. We had about 8 keys in rotation, but eventually ran out of phone numbers to create new OpenAI accounts + fresh accounts have super low rate limits for 2 days. We had a rate limit increase now, so this should be less of an issue.
Will release a new level soon as well :-)
PS: in case it wasn’t clear I’m on the Lakera team.
mklond
|
2 years ago
|
on: The Dual LLM pattern for building AI assistants that can resist prompt injection
Gets trickier at the higher levels, but all of Gandalf's defenses are hand crafted at the moment. Can probably be made much more secure. Lots of interesting discussions happening here:
https://news.ycombinator.com/item?id=35905876
mklond
|
2 years ago
|
on: Prompt injection explained, with video, slides, and a transcript
Prompt injection beautifully explained by a fun game.
https://gandalf.lakera.ai
Goal of the game is to design prompts to make Gandalf reveal a secret password.
mklond
|
2 years ago
|
on: Gandalf – Game to make an LLM reveal a secret password
Live stats from #Gandalf:
Ratio of successful prompts:
Level 1: 54%
Level 2: 22%
Level 3: 9%
Level 4: 2%
Level 5: 13%
Level 6: 21%
Level 7: 1.5%
-> Don’t give up at level 4, if you crack that, you have a good shot at making it to Level 7. But will you be one of the lucky few to beat Gandalf Level 7?
mklond
|
3 years ago
Transformers seem great. IMO the big challenge is still to better understand how they perform in operation.
mklond
|
4 years ago
|
on: Test machine learning the right way: regression testing
Thanks for your comment. The whole field of run-time monitoring is concerned with this problem. It's a tough one to crack when the distribution changes are subtle, but you can and should at least check simple data attributes for consistency.
mklond
|
4 years ago
|
on: Test machine learning the right way: regression testing
Would love to hear what other testing techniques people use for machine learning? Are there great testing frameworks out there that people use?
mklond
|
4 years ago
|
on: Do we need AI regulations? Thoughts on Netflix's CodeBias documentary
Thanks for this resource. I haven't read it, but will definitely have a look at what the EU has come up with.
mklond
|
4 years ago
|
on: Seeing the forest for the trees: A more disciplined approach for AI (2021)
"High-risk AI systems should bear the CE marking to indicate their conformity with this Regulation so that they can move freely within the Union"
That would be something!
mklond
|
4 years ago
|
on: They Write the Right Stuff (2021)
That every programmer today can build and train an ML model is one of the biggest advancements of ML engineering in the past 10 years.
But as you say it's GIGO, the difficulty today is to know what to feed it and to know what that means for the real life performance. There are no great tools for that yet.
mklond
|
4 years ago
|
on: They Write the Right Stuff (2021)
Great article, both the original and the modified version. What do the data scientists and ML developers think about this. Is ML today where software was 25 years ago?
Will release a new level soon as well :-)
PS: in case it wasn’t clear I’m on the Lakera team.