top | item 45759158

(no title)

yosito | 4 months ago

In layman's terms, this seems to mean that given a certain unedited LLM output, plus complete information about the LLM, they can determine what prompt was used to create the output. Except that in practice this works almost never. Am I understanding correctly?

discuss

order

ctenb|4 months ago

No, it's about the distribution being injective, not a single sampled response. So you need a lot of outputs of the same prompt, and know the LLM, and then you should in theory be able to reconstruct the original prompt.

eapriv|4 months ago

No, it says nothing about LLM output being invertible.