top | item 34824987 (no title) NIL8 | 3 years ago Fascinating. Now, I want to try it before the humans put a stop to it :) discuss order hn newest linuxdeveloper|3 years ago I failed to replicate the attack later in the evening in a "new" conversation. It does appear to me the model is learning between conversations, even without human input or RLHF.
linuxdeveloper|3 years ago I failed to replicate the attack later in the evening in a "new" conversation. It does appear to me the model is learning between conversations, even without human input or RLHF.
linuxdeveloper|3 years ago