top | item 33024194

(no title)

agitator | 3 years ago

What's mind blowing is that you can extrapolate where this is going to go. Eventually, you will be able to generate full movie scenes from descriptions.

What's interesting to me is how this is so similar to human imagination. Give me a description and I will fabricate the visuals in my mind. Some aspects will be detailed, others will be vague, or unimportant. Crazy to see how fast AI is progressing. Machines are approaching the ability to interpret and visualize text in the same way humans can.

This also fascinates me as a form of compression. You can transmit concepts and descriptions without transmitting pixel data, and the visuals can be generated onsite. Wonder if there is some practical application for this.

discuss

order

treis|3 years ago

IMHO this particular avenue is a dead end. It's an extraordinarily impressive dead end but it's clear that there's no real understanding here. Look at this video of the knight riding a horse:

>https://makeavideo.studio/assets/A_knight_riding_on_a_horse_...

The horse's face is all wrong

The gait is wrong

The interface with the ground & hooves is wrong

The knight's upper body doesn't match with the lower and they're not moving correctly

I think ultimately the right path is something like AI automated Blender. AI creates the models & actions while Blender renders it according to a rules based physics engine.

yreg|3 years ago

Of course there "is no understanding here", but yet it's not all wrong. Somehow it did move the horse's legs roughly correctly (using the proper joints and all), somehow the cape is moving roughly as it should through the air and the knight's body absorbs the force of stomping on the ground…

It doesn't seem that the fundamental inability to understand what is going on in the scene is stopping models of this kind to eventually lead to realistic results.

Same applies to DALL-E and GPT.

numtel|3 years ago

"Don't look where we are right now but imagine where we'll be two more papers down the line" - Two minute papers

giarc|3 years ago

Would be interesting to input some existing screenplays into a future tool like this and see what comes out.

bdickason|3 years ago

Or full 3D scenes that are interactive?