(no title)
byefruit | 10 months ago
Not to take away from their work but this shouldn't be buried at the bottom of the page - there's a gulf between completely new models and fine-tuning.
byefruit | 10 months ago
Not to take away from their work but this shouldn't be buried at the bottom of the page - there's a gulf between completely new models and fine-tuning.
israrkhan|10 months ago
If they needed to assign their own name to it, at least they could have included the parent (and grant parent) model names in the name.
Just like the name DeepSeek-R1-Distill-Qwen-7B clearly says that it is a distilled Qwen model.
qeternity|10 months ago
Otoh, there aren't many frontier labs that have actually done finetunes.
lumost|10 months ago
How many of the latest databases are postgres forks?
adamkochanowicz|10 months ago
rahimnathwani|10 months ago
GodelNumbering|10 months ago