top | item 45279548

(no title)

-_- | 5 months ago

There needs to be some way of automatically assessing performance on the task, though this could be with a Python function or another LLM as a judge (or a combination!)

discuss

order

No comments yet.