That's an idea we've thought about. However, we think the open source community has already created a very impressive set of language or region-specific finetunes [1] [2]. Also there is a lot of cultural and nuance context in every language that we don't have the capacity to cover sufficiently. So for v3 we focused on creating the best foundational multilingual model.[1] https://huggingface.co/aiplanet/buddhi-indic
[2] https://ai.google.dev/gemma/gemmaverse/sealion
jjani|11 months ago
Happy to elaborate if there's a way to get in touch, in case the team isn't aware of this.
mdp2021|11 months ago
alekandreev|11 months ago
Workaccount2|11 months ago
It would also kind of suck for non-english speakers, because it will just be another feather in the hat of "English eats the world".