top | item 41916770 (no title) peakji | 1 year ago It is an LLM fine-tuned using a new type of dataset and RL reward. It's good at reasoning, but I would not recommend to replace Llama for general tasks. discuss order hn newest No comments yet.
No comments yet.