top | item 32466559

(no title)

ML revolutionized translation a long time ago and the demand and pay for translators went down. It also used the works of thousands of humans who translated text and they never saw a dime. Copyright applies to translations so we've already gone through something similar. The same will happen with art and every other medium that ML touches. I have a love/hate relationship with ML because of this. It seems that there will be a painful transition period as many humans are displaced, probably even average coders like me. In a way, it's no different than what a human does but humans can't scale like computers. I hope someone can engineer a new economic system that works for what's coming.

discuss

avian|3 years ago

> ML revolutionized translation a long time ago and the demand and pay for translators went down.

ML translation also caused a noticeable drop in the average quality of translated text in my experience. Companies now ship machine-translated manuals for minor languages that are often little more than gibberish. Human translators weren't perfect - for example, often you could see that the translator didn't know the terminology of the field. But at least the rest of the text was usually intelligible.

If this is where we're heading to with visual art as well I'm not looking forward to the future. Imagine an instruction manual where illustrations are machine-created. Everything looks kind of weird and inconsistent. If you quickly flip through the book it looks like all the important points are shown on the pictures, but looking closely shapes don't match with reality and the details are all wrong. The number of bolts changes from picture to picture. It's all there just to check a mark on someone's list, but doesn't really help you in servicing the thing.

martopix|3 years ago

I disagree with the first part, I remember home appliances made in Asia in the 90s that came with completely ridiculous manuals, possibly translated with a dictionary and little else. Machine translation can really be good if used properly. Of course I wouldn't use it to translate literature.

vintermann|3 years ago

That sort of translation always had a lemon problem. How do you know if the translation to Portuguese is any good, if you don't speak Portuguese and don't know anyone who does? You can pay the expensive translator company, or the dodgy cheaper translator company that claims to be equally good.

Did you choose the responsible expensive option? Sucks, because they have been competing with the bad/automated translators so long that they squeeze their human translators so hard, they produce bad translations anyway.

Also, it didn't help that you didn't give them any context, but just sent them one sentence at a time, extracted from the strings in your program. Because we've all done that, haven't we?

MonkeyMalarky|3 years ago

A DALL-E generated IKEA style manual for a nonexistent piece of furniture would be absolutely hilarious.

A4ET8a8uTh0|3 years ago

Hah. Yes. And as an amusing result, emails and chats are filled with errors resulting from automated translation. The service was supposed to be a crutch, but it became widely adopted and lowered the standards everywhere.

Anecdotally, I now check my email, because Outlook likes to change what I wrote.

shadowgovt|3 years ago

It's honestly pretty fascinating to watch perfect be the enemy of the good in this space. As a result of the birth of cheap-to-free machine translation of human language, the space of things available on the global market has become incredibly vast, because the good-enough stab at translation is often good enough for a dedicated hacker to make the thing work anyway.

tkgally|3 years ago

Former professional translator here (Japanese-to-English, 1986 to 2005). I have very mixed feelings about MT.

On the one hand, as you and others point out, rates for human translation have been hurt, and the quality of a lot of MT-enabled translation visible to the public is mediocre to poor.

On the other hand, MT is enabling communication among people that was not possible or practical before. A small start-up is able to find customers and suppliers in countries with different languages. People with common interests but diverse languages are able to chat and share ideas on social media. A person receives an e-mail from a long-lost relative in a language the recipient doesn’t know; MT enables them to correspond and eventually meet up. Little of that communication would be taking place if a human translator had to be found and paid for each interaction.

sinity|3 years ago

> On the other hand, MT is enabling communication among people that was not possible or practical before. A small start-up is able to find customers and suppliers in countries with different languages. People with common interests but diverse languages are able to chat and share ideas on social media. A person receives an e-mail from a long-lost relative in a language the recipient doesn’t know; MT enables them to correspond and eventually meet up. Little of that communication would be taking place if a human translator had to be found and paid for each interaction.

Yep. I translated tens of KWords between English and Polish by now - thanks to Deepl, which speeds this up by at least an order of magnitude (and the end result is probably better too).

I even planned to translate a book only available in Polish ([this one](https://en.wikipedia.org/wiki/Perfect_Imperfection)) - I translated about 3% already, but unfortunately the author didn't consent to sharing the translation anywhere - which is annoying, because original (Polish) text is freely available on libgen. And English version doesn't exist. And the book is 18 years old at this point. So the potentially lost profits are negligible or nil. Meh.

OmarAssadi|3 years ago

Good points!

As an example, as a hobby, I am interested in things like anime video compression and such. Two of the most essential sites for me in terms of resources and information have been almost entirely Chinese. And while they appreciate and respect the international users, at the end of the day, they can’t translate each and every single forum post into English or any other language that someone may need.

And also unfortunately, I don’t have the time yet to invest in learning the language enough to use those sites without assistance. Instead, I rely heavily on things like Google Translate, Yandex, and DeepL. They’ve been totally essential to the point to where a major pain point for me with my preferred browser (FireFox) has been a lack of native MT support (until very recently, which has been nice for Russian and stuff, but I’m still waiting for Chinese support in Bergamot).

Similarly, while I studied in Russia for a while, I’m not a native speaker, and my skills aren’t what they used to be now that I’ve been out of the country for a while. There is a lot of interesting stuff on the Russian internet (Habr, RuTracker, various articles, etc.) that I only really have the level of convenient access to that I do because of MT.

I don’t think it’s all sunshine and rainbows, certainly, as with stuff like DALL-E 2, but it definitely provides a lot of people real value and happiness. Hopefully, humanity will figure out how best to balance the positives and negatives of AI services like these.

I see that both sides (lol, sorry) of this often take pretty hardline stances on “art is art; there is no difference” vs. “these are just copy-pasting people’s hardwork and can’t actually create anything themselves”. I think the value is sort of indisputable, but I worry about how it’ll affect people’s livelihoods — e.g., after playing with DALL-E 2, I think it is totally capable of replacing things like stock photos a large percentage of the time (not always, of course, but it definitely can sometimes).

I’m of the mindset that you should be able to remix, adapt, copy, etc, virtually everything in an ideal world. Maybe I am just entitled, I don’t see why I should have to waste the man-hours recreating or reverse engineering something simply because of a license incompatibility, for example.

Similarly, I don’t see why I shouldn’t be able to transform or straight-up copy and distribute someone else’s art if it provides people with happiness. As an example, talking about anime earlier, I think things like fansubs can provide lots of additional value to people by either providing more natural translations or through extensive typesetting that companies tend not to do because it is not “necessary”. For example, those familiar with the show Bakemonogatari are probably aware of the extreme amount of dialogue displayed exclusively visually, with rapid flashing cards and such. While I think the official subtitles don’t get enough credit 99% of the time, that is a prime example where fansubs which go in and actively mask over the text on the screen and replace it with the viewer’s native language is very helpful. However, despite the value these things provide, it is copyright infringement.

Rationally, I understand that the emotional and protective side of people exists, but at the same time, I don’t “understand” it; I want people to use my stuff — I want people to be happy with their limited time on this planet. In that sense, I think the emotional argument for copyright does not appeal to me.

That being said, pragmatically, people need jobs. We live in a society where you must work. A society where you have bills to pay, groceries to buy, and fees for schools and hospitals. Stripping away copyright protections entirely for things like this would hurt a lot of people right now. And I am really curious how this’ll all turn out in court when the majority of these datasets are trained on things that they have no right to be using.

Will we be in a world where you can essentially bypass all copyright protections by throwing everything into TensorFlow before feeding the computer a little prompt? Or will we be in a world where many of these tools are crippled to the point of uselessness? Or will we be in a world where companies just accept that sometimes someone will take them to court and settlements will just be another business expense?

Aside from the economic issues, if I’m going to be writing some Adderall-infused essay right now, I figure I may as well rant about the other two.

The first of those is biases; while I have always been aware of this problem, it was very evident after playing with DALL-E 2 this past week. The datasets seem to be very European-centric (US, Canada, NZ, AU, etc included in that). There were a number of things that I could not get it to generate correctly or would produce wildly different results based on the prompts I fed it.

As relatively minor, silly example, while I had no trouble getting it to produce white-washed movie posters for by asking it for “Netflix live-action adaptation of Dragon Ball Z”, I couldn’t get it to do the same for India and Bollywood. I am sure someone who has spent more time doing “prompt engineering” than I have could possibly get better results, but the point still stands in the sense that it seems to like to produce what Americans & Europeans are more exposed to.

Another example is, just ignorantly guessing, the way things are weighted — e.g., I wanted to have it produce art in the style of shows like Tatami Galaxy, The Night Is Short, Walk On Girl, and things like the album covers for Asian Kung-fu Generation. All of those listed are done by Yusuke Nakamura. However, perhaps rubbing salt in the wound for artists, I was unable to get the results I wanted by asking for things in Yusuke Nakamura’s style. Instead, I had much better results by using “Yuasa Masaaki”, the director of Tatami Galaxy and The Night Is Short, Walk On Girl. My only guess here is just that Yuasa is a more common occurrence in text, at least in terms of the English internet.

In that sense, I worry that these tools will not only replace people, but they’ll reinforce existing cultural and societal biases even more so than we already do on our own now.

Further compounding issues like that is the censorship and filtering on OpenAI; virtually anything political, violent or sexual is not currently allowed (and may never be? I figure no company wants to be associated with the potential PR disasters that come with it, like Microsoft’s Tay). This is extra problematic given how important all of those things are in art. I worry that we will lose artists and have fewer people pushing the boundaries of art.

And ironically, for a society that criticizes countries like China so fervently for censorship, we rely so heavily on giant capitalist companies which self-enforce much of the same censorship, helping further enforce the status quo and restricting marginalized minorities. I am not a fan to say the least, despite understanding the potential for violent speech, propaganda, etc.

Anyway, that’s the end of my long-winded 6 AM phone rant. Apologies for any weird formatting or incoherent thoughts!

tgv|3 years ago

That the translators "never saw a dime" isn't accurate. A large corpus of parallel texts is supplied by the EU, which pays its translators and then publishes the documents for free. Other parallel corpora include manuals. While there's definitely copyright on that, it doesn't lie with the translators.

0x_rs|3 years ago

Machine translation has been a curse for those who enjoy content that's not usually promptly translated and instead sits there for years if not decades. Not uncommon are people asking in broken English how to use certain tools e.g. automated DeepL scripts and releasing their "translations" (or edited machine translations, changing just some of most glaring issues with the text, generally), thus "burning" a release, that is not going to be taken by real translators that require weeks, months of draining work to complete so that context, subtlety and most meaning from the original source is not lost.

It's a race to the bottom. It doesn't sound nice to say, but as things become more accessible it's harder and harder to filter out genuine content from look-alikes, asset flips and low-effort automations, this may be unfamiliar to anybody looking at Steam releases for example. I wonder how it'll turn out for artwork. I want to be optimistic and think of it as an additional tool to artists or people that want to create new things rather than a displacement wave with its significantly smaller costs.

kmeisthax|3 years ago

I can concur; there was a scandal within the fan translation scene for PC-98 games when it came out that at least one high-profile "translator" was just lightly-editing machine translated output to sound correct. Problem is, there are all sorts of mistakes you can make in translation that will not sound incorrect at all unless you speak both languages.

I must ask what you mean by "burning" a release, though. If we're talking about fan translations, stuff gets retranslated all the time. And most of the stuff that gets fan translated is far too niche to justify a commercial release to begin with. If something is popular enough to get an official release, the fan translations either get taken down or disappear on their own.

trention|3 years ago

>ML revolutionized translation a long time ago and the demand and pay for translators went down.

This is incorrect.

Translators employed:

2005 - 29240

2021 - 52170

Yearly salary:

2005 - 38300

2021 - 52170

Source - BLS (https://www.bls.gov/oes/current/oes_nat.htm)

Employment growth was ~79% on a US population growth of ~13.5%.

38300 2005 $ equal 52854 2021$.

So pretty much demand for translators exploded while real wages stayed flat.

kevingadd|3 years ago

Translation is not exclusively a salaried job. Most translators in the fields I'm familiar with (games and manga) are freelance contractors, and most are getting paid by the word/page, not a yearly salary.

ML has depressed the rates those people earn. It's been good for the demand for editors though since they need to pay someone to fix the DeepL output up so they can release it.

matheusmoreira|3 years ago

> I hope someone can engineer a new economic system that works for what's coming.

What's coming is post-scarcity for all kinds of intellectual work. There will be nothing to economize: supply of both labor and product will be infinite. I don't want corporations engineering a made up economy where there is none.

trention|3 years ago

>post-scarcity for all kinds of intellectual work

Unfortunately, all intellectuals also need to eat, bathe and clothe - areas where no "post-scarcity" is on the horizon (with climate change, both food and water may become less abundant). So an economic system will need to be invented.

soulofmischief|3 years ago

I predict a new system arises in which styles themselves are copyrightable and eligible for royalties.

Human-sourced styles will become trend, as people seek originality to establish identity in an otherwise indistinguishable world of machine-driven experiences.

bsedlm|3 years ago

I'm wating for the algorithms to automate all my decision making too.

so me in the future will be an algorithm suggesting what should I buy/eat/etc.. and another which will accept/reject what the former suggests.

this way, I don't have to bother with any of this annoying "life" stuff.

/joking

verisimi|3 years ago

Yes, if you were ruler of the world, you're going to have an awfully huge amount of people doing f.a. There are only so many hairdressers, masseurs and plumbers required.

What you need is a kill switch, something to slow down population growth.

Anyway, are you up to date with your mandatory medical treatments?

TaylorAlexander|3 years ago

Community ownership. We need everyone to be part owner in the economy that they rely on. Then productivity gains are good for people even if there is less work to do.

JohnHaugeland|3 years ago

> the demand and pay for translators went down.

Prices on ProZ are basically exactly what they were before Google Translate