top | item 43775762

(no title)

imenani | 10 months ago

They fix the temperature at T=0.6 for all k for all models, even though their own Figure 10 shows that RL model benefits from higher temperatures. I would buy the overall claim much more if they swept of temperature parameter for each k and model like they did in the Codex paper [1].

[1] https://arxiv.org/abs/2107.03374

discuss

order

No comments yet.