(no title)
HALtheWise | 1 year ago
- Not an "exam" composed of single-correct-answer closed-form questions with objective answers
- Not consisting of questions that humans/humanity is capable of answering.
For example, a future evaluation for an LLM could consist of playing chess really well or solving the Riemann Hypothesis or curing some disease, but those aren't tasks you would ever put on an exam for a student.
krackers|1 year ago