WingNews logo WingNews
top | new | best | ask | show | jobs
top | item 44006459

(no title)

haffi112 | 9 months ago

(watching live) I'm wondering how it performs on the METR benchmark (https://metr.org/blog/2025-03-19-measuring-ai-ability-to-com...).

discuss

order

No comments yet.

powered by hn/api // news.ycombinator.com