top | item 44389054

(no title)

ramity | 8 months ago

elzbardico is pointing out how the author is having the confidence value generated in the output of the response rather than it being the confidence of the output.

discuss

order

bckr|8 months ago

Is there research solid knowledge on this?

baby|8 months ago

this trick is being used by many apps (including Github copilot reviews). The way I see it, is that if the agent has an eager-to-please problem, then you give it a way out