This is important work. And think about African or southeast Asian languages, they are even more screwed. We need to make sure that AI multilingual to avoid total English domination of culture.
Languages are like frameworks, they (slightly) guide your thinking. Think about the same stuff in different languages and you’ll probably get more ideas about it than in one language only.
A lingua franca is useful and probably inevitable. The downside is language and cultural loss. Works in translation are rarely quite as good, especially humor and wordplay. This is why various countries have "local language quota" rules for media; although derided by English speakers and HN, they're a way to keep the local language, culture and identity alive.
Not from an aesthetic sense. I think it's really cool that we have a lot of languages. I'm personally willing to pay a high price in inconvenience to keep that coolness around, although not everyone would.
However I also don't think we will have to. Machine translation and language learning are substitute goods -- the better the former gets, the fewer people who will feel any desire to pursue the latter, because it just won't be that big of a deal to translate between X and Y anyway.
A universal second language for commerce is a fine middle ground, though.
The languages you think in affect your decision making, your creativity, how you perceive the world. If we were restricted to a single language, we’d lose as individuals and as a species.
It is a concern because presumably most people in office jobs are going to need to be able to use these tools, but I am somewhat comforted to know one language that AI systems do not understand well yet because of lack of texts. However, I think that will be short lived.
While I can speak in Portuguese without much issues (except being hard for them to stick to European Portuguese), I've nooticed that sometimes it uses a clear translation of an English expression that does not feel natural in Portuguese at all.
c7DJTLrn|2 years ago
Not saying English is an ideal language, but I'm interested in why you think it shouldn't dominate. Wouldn't a universal language be a good thing?
aziaziazi|2 years ago
pjc50|2 years ago
hiAndrewQuinn|2 years ago
However I also don't think we will have to. Machine translation and language learning are substitute goods -- the better the former gets, the fewer people who will feel any desire to pursue the latter, because it just won't be that big of a deal to translate between X and Y anyway.
A universal second language for commerce is a fine middle ground, though.
latexr|2 years ago
https://www.theguardian.com/science/2023/sep/17/how-learning...
otabdeveloper4|2 years ago
tellarin|2 years ago
- https://txt.cohere.com/aya-multilingual/
- https://aya.for.ai
I'm a contributor to the project and all data and model will be open-sourced.
We're looking for contributors in many languages!
mchaver|2 years ago
loxdalen|2 years ago
rjtavares|2 years ago
asutekku|2 years ago
benatkin|2 years ago
DoingIsLearning|2 years ago
Most countries' classic texts and books are still undigitized sitting in Libraries and public archives.
Also book publishing market and online publishing are proportional to total population, smaller country means less content.
ben_w|2 years ago
I find this very plausible.