top | item 44053907 (no title) gyudin | 9 months ago Super weird benchmarks discuss order hn newest avereveard|9 months ago from what I gather it's finetuned to use OpenHand specifically so shows value on thsoe benchmark that target a whole system as a blackbox (i.e. agent + llm) more than directly target the llm input/outputs amarcheschi|9 months ago Yup the 1st comment says this https://www.reddit.com/r/LocalLLaMA/comments/1kryybf/mistral...
avereveard|9 months ago from what I gather it's finetuned to use OpenHand specifically so shows value on thsoe benchmark that target a whole system as a blackbox (i.e. agent + llm) more than directly target the llm input/outputs amarcheschi|9 months ago Yup the 1st comment says this https://www.reddit.com/r/LocalLLaMA/comments/1kryybf/mistral...
amarcheschi|9 months ago Yup the 1st comment says this https://www.reddit.com/r/LocalLLaMA/comments/1kryybf/mistral...
avereveard|9 months ago
amarcheschi|9 months ago