2 years ago|discuss
user: weichiang
181 karma | created 3 years ago
recent submissions
2 years ago|discuss
2 years ago|discuss
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
(twitter.com)
20 pts|2 years ago|discuss
1 pts|2 years ago|discuss
2 years ago|discuss
Chatbot Arena: a crowd-sourced LLM leaderboard
(twitter.com)
1 pts|2 years ago|1 comment
2 years ago|discuss
2 years ago|discuss
126 pts|2 years ago|84 comments
2 years ago|discuss
271 pts|2 years ago|139 comments
Who's GPT-4's favorite? Battles between state-of-the-art chatbots.
(vicuna.lmsys.org)
6 pts|2 years ago|4 comments
2 years ago|discuss