top | item 44428487 (no title) matthewolfe | 8 months ago To echo the other replies, the tokenizer is definitely not the bottleneck. It just happens to be the first step in inference, so it's what I did first. discuss order hn newest No comments yet.
No comments yet.