top | item 46597108 (no title) lanstin | 1 month ago We need to train LLMs in a situation like a semi-trustworthy older sibling trying to get you to fall for tricks. discuss order hn newest TeMPOraL|1 month ago That's what we are doing, with the Internet playing the role of the sibling. Every successful attack the vendors learn about becomes an example to train next iteration of models to resist.
TeMPOraL|1 month ago That's what we are doing, with the Internet playing the role of the sibling. Every successful attack the vendors learn about becomes an example to train next iteration of models to resist.
TeMPOraL|1 month ago