hnthrowaway9812's comments

hnthrowaway9812 | 1 year ago | on: Snowflake Arctic Instruct (128x3B MoE), largest open source model

It's a wast if they are mostly all trying the SAME things. Which is mostly what is happening.

I want someone to spend a million on a Chess LLM so we can get a sense of how sophisticated they can get at non-linguistic pattern matching.

I want someone to spend a million on an LLM trained on Python program traces so we can try to teach it cause and effect and "debugging". Maybe it will emulate a Python interpreter and get highly reliable at predicting the outcome of Python code.

etc.

hnthrowaway9812 | 1 year ago | on: Snowflake Arctic Instruct (128x3B MoE), largest open source model

You've nerdsniped me so hard that I had to make an account.

There are DOZENS of orgs releasing foundational models, not "a handful."

Salesforce, EleuthierAI, NVIDIA, Amazon, Stanford, RedPajama, Cohere, Mistral, MosaicML, Yandex, Huawei StabilityLM, ...

https://docs.google.com/spreadsheets/d/1kT4or6b0Fedd-W_jMwYp...

It's completely bonkers and a huge waste of resources. Most of them will see barely any use at all.

page 1