top | item 43551334

Ask HN: Why is Generative AI non-deterministic

2 points| michaeljx | 11 months ago

Given a trained LLM model with fixed weights, why is it that the same prompt yields different responses? Or is it the case that some type of RL takes place?

7 comments

bigyabai|11 months ago

Generative AI typically is deterministic, most inference software includes a random seed to yield different results on each repeated entry.

minimaxir|11 months ago

That's not strictly correct. All LLMs output logits softmax'd into a probability distribution of the next token, and this distribution is indeed deterministic.

Most generative AI apps set a nonzero temperature which scales the probability. So if you have a distribution with 50%, 30%, 20% for tokens, and a temperature of 1, then you'd up to 3 different outputs sampled at those exact probabilities, which iteratively cascade into completely different texts. The RNG of the probability selections can be controlled by a seed but with distributed systems that is often not the case: I've only seen seeds returned for cases where the entire model is on a single system. Otherwise, just not using a seed is fine for sufficient randomness.

If the temperature is 0, then it instead chooses the token with the highest probability, and done iteratively the final output will be the same. (this is not accounting for distributed system weirdness)

schoen|11 months ago

It's deliberately made nondeterministic, partly using something called softmax

https://en.wikipedia.org/wiki/Softmax_function

I'd say mainly in order to avoid boring its users.

minimaxir|11 months ago

Softmax just normalizes the logit outputs to be positive and sum to 1.0, it doesn't have an effect on determinism.