top | item 43293149

(no title)

bfors | 11 months ago

Perhaps they already evaluated their LLM judge model (with another LLM)

discuss

order

No comments yet.