top | item 44834358 (no title) rullelito | 6 months ago Without knowing too much about ML training, generated output from the own model must be much easier to understand since it generates data that is more likely to be similar to the training set? Is this correct? discuss order hn newest jondwillis|6 months ago I don’t think so. The training data, or some other filter applied to the output tokens, is resulting in each model indicating that it is the best.The self-preference is almost certainly coming from post-processing, or more likely because the model name is inserted into the system prompt.
jondwillis|6 months ago I don’t think so. The training data, or some other filter applied to the output tokens, is resulting in each model indicating that it is the best.The self-preference is almost certainly coming from post-processing, or more likely because the model name is inserted into the system prompt.
jondwillis|6 months ago
The self-preference is almost certainly coming from post-processing, or more likely because the model name is inserted into the system prompt.