top | item 37007254

(no title)

crazypython | 2 years ago

The GPT-3.0 "davinci-instruct-beta" models have been returning non-deterministic logprobs as early as early 2021. This is speculation. CUDA itself often has nondeterminism bugs.

text-davinci-001 and text-davinci-002 were trained through FeedMe and SFT, while text-davinci-003 was RLHF; the models themselves have more variance at high temperature.

discuss

order

cubefox|2 years ago

What about the foundation models, i.e. davinci and code-davinci-002?