top | item 47051467

(no title)

The funny thing is that Anthropic is the only lab without an open source model

discuss

jack_pp|12 days ago

And you believe the other open source models are a signal for ethics?

Don't have a dog in this fight, haven't done enough research to proclaim any LLM provider as ethical but I pretty much know the reason Meta has an open source model isn't because they're good guys.

bigyabai|12 days ago

> Don't have a dog in this fight,

That's probably why you don't get it, then. Facebook was the primary contributor behind Pytorch, which basically set the stage for early GPT implementations.

For all the issues you might have with Meta's social media, Facebook AI Research Labs have an excellent reputation in the industry and contributed greatly to where we are now. Same goes for Google Brain/DeepMind despite their Google's advertisement monopoly; things aren't ethically black-and-white.

imiric|12 days ago

The strongest signal for ethics is whether the product or company has "open" in its name.

m4rtink|12 days ago

Can those be even called open source if you can't rebuild if from the source yourself?

argee|12 days ago

Even if you can rebuild it, it isn’t necessarily “open source” (see: commons clause).

As far as these model releases, I believe the term is “open weights”.

anonym29|12 days ago

Open weights fulfill a lot of functional the properties of open source, even if not all of them. Consider the classic CIA triad - confidentiality, integrity, and availability. You can achieve all of these to a much greater degree with locally-run open weight models than you can with cloud inference providers.

We may not have the full logic introspection capabilities, the ease of modification (though you can still do some, like fine-tuning), and reproducibility that full source code offers, but open weight models bear more than a passing resemblance to the spirit of open source, even though they're not completely true to form.

j45|12 days ago

They are, at the same time I considered their model more specialized than everyone trying to make a general purpose model.

I would only use it for certain things, and I guess others are finding that useful too.

colordrops|12 days ago

Are any of the models they've released useful or threats to their main models?

vunderba|12 days ago

I use Gemma3 27b [1] daily for document analysis and image classification. While I wouldn't call it a threat it's a very useful multimodal model that'll run even on modest machines.

[1] - https://huggingface.co/google/gemma-3-27b-it

evilduck|12 days ago

Gemma and GPT-OSS are both useful. Neither are threats to their frontier models though.