top | item 39361684 (no title) keketi | 2 years ago Every output token costs GPU time and thereby money. They could have tuned the model to be less verbose in this way. discuss order hn newest No comments yet.
No comments yet.