top | item 36235086

(no title)

Impressive work, but I think the title is misleading. Saying it is “near GPT-4” tends to imply that it outperforms ChatGPT (3.5). It does outperform it on a handful of tasks, but overall is slightly worse.

That aside I think this is really cool and hopefully we keep seeing this kind of improvement on smaller models.

I’m also curious if we know how many parameters the current model of ChatGPT 3.5 has? The API is really cheap, which makes me think it has less than the 175b in the larger GPT-3.

discuss

huijzer|2 years ago

Oh sorry I didn’t look properly then. I thought it outperformed GPT-3.5 consistently. My bad. Thanks for the correction.