top | item 47215428

(no title)

AndrewAndrewsen | 11 hours ago

Awesome project! I recently ran a (semi-)crowdsourced quality benchmarking for models ≤20b

How do you benchmark them? This would be awesome to implement at the page as well. I will link to this project at https://mlemarena.top/

discuss

order

No comments yet.