top | item 46420542

(no title)

tessierashpool9 | 2 months ago

I mean, isn't that a little ridiculous? Aren't those language models already solving complicated exam questions and mathematical problems?

discuss

order

geon|2 months ago

According to the creators, the models are on a phd level of intelligence, but they can’t get the simplest thing right.

tessierashpool9|2 months ago

Overselling is only the tip of the iceberg. The real problem is that a lot of managers base their decision to introduce language models into business processes on cutting edge Pro edition demos, but what is, of course, actually used in production is some cheap Nano/Flash/Mini version.