I'm part of the team, and we used LLM agents extensively for smart bug finding and patching. I'm happy to discuss some insights, and share all of the approaches after grand final :)
AIxCC is an AI Cyber Challenge launched by DARPA and ARPA-H.
Notably, a zero-day vulnerability in SQLite3 was discovered and patched during the AIxCC semifinals, demonstrating the potential of LLM-based approaches in bug finding.
Notably, an undiscovered trivial NULL pointer dereference in SQLite3's SQL parser was discovered and patched. But yeah, it makes very good marketing material.
this is really impressive work. coverage guided and especially directed fuzing can be extremely difficult. its mentioned fuzzing is not a dumb technique. I think the classical idea is kind of dumb, in the sense of 'dumb fuzzers' but these days there is tons of intelligence built around it now aand poured into it, but i've always thought its now beyond the classic idea of fuzz testing. i had colleagues who poured their soul into trying to use git commit info etc. to try and help find potentially bad code paths and then coverage guided fuzzing trying to get in there. I really like the little note at the bottom about this. adding such layers kind of does make it lean towards machine learning nowadays, and id think perhaps fuzzing is not the right term anymore. i dont think many people are actually still simply generating random inputs and trying to crash programs like that.
this is really exciting new progress around this type of field guys. well done! cant wait to see what new tools and techniques will be yielded from all of this research.
Will you guys be open to implementing something around libafl++ perhaps? i remember we worked with that extensively. As a lot of shops use that already it might be cool to look at integration into such tools or would you think this deviates so far it'll amount to a new kind of tool entirely? Also, the work on datasets might be really valuable to other researchers. there was a mention of wasted work but labeled sets of data around cve, bug and patch commits can help a lot of folks if theres new data in there.
this kind of makes me miss having my head in this space :D cool stuff and massive congrats on being finalists. thanks for the extensive writeup!
I heard that the AIxCC booth prepared the same challenges for the audience to solve manually, but I didn’t check the details.
I believe there will be even more cool stuff in next year’s grand final. If you want to get a sense of what to expect, check out the DARPA CGC from 2016. :)
[+] [-] hqzhao|1 year ago|reply
[+] [-] wslh|1 year ago|reply
[+] [-] adragos|1 year ago|reply
Have you tested your CRS on weekend CTFs? I’m curious how well it’d be able to perform compared to other teams
[+] [-] doctorpangloss|1 year ago|reply
[+] [-] simonw|1 year ago|reply
[+] [-] garlic_chives|1 year ago|reply
Notably, a zero-day vulnerability in SQLite3 was discovered and patched during the AIxCC semifinals, demonstrating the potential of LLM-based approaches in bug finding.
[+] [-] rfoo|1 year ago|reply
[+] [-] hypeatei|1 year ago|reply
[+] [-] unknown|1 year ago|reply
[deleted]
[+] [-] sim7c00|1 year ago|reply
this is really exciting new progress around this type of field guys. well done! cant wait to see what new tools and techniques will be yielded from all of this research.
Will you guys be open to implementing something around libafl++ perhaps? i remember we worked with that extensively. As a lot of shops use that already it might be cool to look at integration into such tools or would you think this deviates so far it'll amount to a new kind of tool entirely? Also, the work on datasets might be really valuable to other researchers. there was a mention of wasted work but labeled sets of data around cve, bug and patch commits can help a lot of folks if theres new data in there.
this kind of makes me miss having my head in this space :D cool stuff and massive congrats on being finalists. thanks for the extensive writeup!
[+] [-] wslh|1 year ago|reply
[1] https://xbow.com/
[2] https://www.sequoiacap.com/article/partnering-with-xbow-the-...
[+] [-] fredmerc|1 year ago|reply
[deleted]
[+] [-] unknown|1 year ago|reply
[deleted]
[+] [-] deeznuttynutz|1 year ago|reply
[+] [-] rockskon|1 year ago|reply
[+] [-] hqzhao|1 year ago|reply
I believe there will be even more cool stuff in next year’s grand final. If you want to get a sense of what to expect, check out the DARPA CGC from 2016. :)
[+] [-] unknown|1 year ago|reply
[deleted]
[+] [-] m3kw9|1 year ago|reply
[deleted]
[+] [-] unknown|1 year ago|reply
[deleted]