top | item 45516346

Samsung released a 7M model that achieved 45% on ARC-AGI-1

34 points| chintler | 5 months ago |arxiv.org | reply

12 comments

[+] magicalhippo|5 months ago|reply

Discussed here:

https://news.ycombinator.com/item?id=45506268 Less is more: Recursive reasoning with tiny networks (54 comments)

[+] firefax|5 months ago|reply

Can someone elaborate on the meaning of "7m model"?

I'm new to AI, and had an LLM spit out an explanation of why some of the "local" models don't work in Ollama on my Air, but... I don't know how accurate the AI is, heh.

It's my understanding most models are more like 1-30b (as in Billion)

[+] magicalhippo|5 months ago|reply

They have just four small layers, rather than several dozen large layers. Off the top of my head, Gemma 3 27B has 63 layers or so. They're also larger since it has a much larger number of embedding dimensions.

Hence they end up with ~7 million weights or parameters, rather than billions.

[+] p1esk|5 months ago|reply

7 million parameters

[+] maccam912|5 months ago|reply

related/duplicate https://news.ycombinator.com/item?id=45506268 I think.

[+] unknown|5 months ago|reply

[deleted]

[+] zwischenzug|5 months ago|reply

Released where?

[+] iblacksand|5 months ago|reply

https://github.com/SamsungSAILMontreal/TinyRecursiveModels

[+] ripped_britches|5 months ago|reply

Wow this is legitimately nuts

[+] koakuma-chan|5 months ago|reply

Why?

[+] byyoung3|5 months ago|reply

seems like they just stole the original HRM (just glanced at this though)