top | item 40997036

(no title)

ancientworldnow | 1 year ago

This was trained to be run at FP8 with no quality loss.

discuss

order

hislaziness|1 year ago

The model description on huggingface says - Model size - 12.2B params, Tensor type - BF16. Is the Tensor type different from the training param size?