(no title)
BoiledCabbage | 1 month ago
And they didn't even bother to test the most important thing. Were the LLM evaluations even accurate! Have graders manually evaluate them and see if the LLMs were even close or were wildly off.
This is clearly someone who had a conclusion to promote regardless of what the data was going to show.
wanderingbit|1 month ago
This is not true; the professor and the TAs graded every student submission. See this paragraph from the article:
(Just in case you are wondering, I graded all exams myself and I asked the TA to also grade the exams; we mostly agreed with the LLM grades, and I aligned mostly with the softie Gemini. However, when examining the cases when my grades disagreed with the council, I found that the council was more consistent across students and I often thought that the council graded more strictly but more fairly.)
leoc|1 month ago
Hnrobert42|1 month ago
https://elevenlabs.io/app/talk-to?agent_id=agent_8101k9d1pq4...
plagiarist|1 month ago
knallfrosch|1 month ago
bsenftner|1 month ago
pooper|1 month ago
https://i.imgur.com/EshEhls.png
When someone at that level pretends to not understand it, there is no way to mince words.
This is malice.
bjt|1 month ago
chairmansteve|1 month ago
Having said that, LLMs can be good tutors if used correctly.
skybrian|1 month ago