top | item 42462216

(no title)

mofeien | 1 year ago

History contains countless examples for the fact that "in order to complete an important task or goal it is useful to exist". It also seems not too difficult to deduce logically. So even if Yudkowsky's fanfiction were excluded from the training data, the model would learn this.

Also, what's the difference between pretending to escape the matrix and escaping the matrix in case of a language model?

discuss

order

echelon|1 year ago

> Also, what's the difference between pretending to escape the matrix and escaping the matrix in case of a language model?

It is neither pretending nor actually escaping.