top | item 42210522

Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations

2 points| mnk47 | 1 year ago |arxiv.org

discuss

order

No comments yet.