top | item 35329570

(no title)

jascii | 2 years ago

I'm having a hard time coming up with a non-nefarious use case for this.

discuss

order

bovermyer|2 years ago

I'd get a kick out of having my own blog posts read to me in James Earl Jones's voice.

Or, heck, my own voice. Though it'd be surreal to hear not-me-but-me saying things I've never said.

woodrowbarlow|2 years ago

even this is ethically questionable. james earl jones's voice is his livelihood.

mahmoudfelfel|2 years ago

We have been seeing some of these genuine use cases: youtube creators, audiobooks, elearning videos, podcasts, commercials, dubbing, and gaming.

tgv|2 years ago

BS. That could just be done without imitating someone's voice.

ksrm|2 years ago

No-one is going to listen to an audiobook made with this. It's still fundamentally just TTS.

zanderwohl|2 years ago

I am toying about with building a virtual puppet software in the style of watchmeforever. I have a number of voices I do for the stage and DnD that I would be willing to train a few models on so I could give my puppets unique voices.

rockemsockem|2 years ago

Anything written can be listened to with this tech. Any news article, any short story, a draft of a piece of writing you're working on. There is too much text for human beings to read it all.

scrollaway|2 years ago

> There is too much text for human beings to read it all.

so your logic is that all that text should be audio and people will consume more? Because I got news for you, reading is faster than listening.

vincnetas|2 years ago

And all AI bots are here to generate even more text. :( We will need to rethink and reevaluate lots of things that we are used to.

erichocean|2 years ago

I'm using this kind of technology for temporary voice tracks in animated shorts.

I'd really like something like Img2Img for voices so I can translate a performance to an arbitrary (synthetic) voice.

nullsense|2 years ago

Tortoise TTS can do this. You just pass it your example as a conditioning latent.

atentaten|2 years ago

Generating audio for an audio book: If an author could speak for 20 minutes and then generate audio for an entire book from the book's text and the model, I think that would be very useful.

sva_|2 years ago

20 seconds*

jeroenhd|2 years ago

Voice generator tech has created some decent surreal memes (like audio recordings of Biden, Obama, and Trump playing video games together).

Outside of memes or maybe the occasional well-intentioned prank, I really can't think of anything either.

Rubinsalamander|2 years ago

Massively reducing costs for Voice Over in Video Games. This should make it even feasible to create mods with audio which would be great :)