top | item 40997036 (no title) ancientworldnow | 1 year ago This was trained to be run at FP8 with no quality loss. discuss order hn newest hislaziness|1 year ago The model description on huggingface says - Model size - 12.2B params, Tensor type - BF16. Is the Tensor type different from the training param size?
hislaziness|1 year ago The model description on huggingface says - Model size - 12.2B params, Tensor type - BF16. Is the Tensor type different from the training param size?
hislaziness|1 year ago