top | item 44682941

(no title)

Squeeze2664 | 7 months ago

How do you determine the importance of a layer in this case?

discuss

order

kkzz99|7 months ago

Afaik they have a test bench that they use and take the activation data from that.

danielhanchen|7 months ago

Yes we have around 1 to 3 million tokens of high quality self verified data that we use to calibrate models!