(no title)
Lucent
|
1 year ago
This is an incredible relief and should be the final nail in the coffin for safety/alignment/shoggoth arguments. It turns out features are completely scrutable, and when modified, we don't see chaotic, schizo non-sequiturs, but a coherent, predictable, globally-consistent shift proving models are operating in a fundamentally understandable way.
No comments yet.