top | item 37154519

(no title)

arugulum | 2 years ago

But what would they be calling out?

If industry groups want to run a training run based on the configurations of a well-performing model, I don't see anything wrong with that. Now, if they were to claim that what they are doing is somehow "optimal", then there would be something to criticize.

discuss

order

swyx|2 years ago

poor choice of words, i probably mean sketching out the curves/doing ablation studies in a comprehensive way like the chinchilla paper did.

arugulum|2 years ago

Makes sense! But expensive...