(no title)
hangsi
|
1 year ago
The common method for choosing the next output token for an LLM is sampling from a Boltzmann distribution. If you have seen the term "temperature" in the context of language models, that is a direct link to the statistical gas mechanics.
whimsicalism|1 year ago