top | item 44258213

Seedance 1.0

219 points| matallo | 8 months ago |seed.bytedance.com

119 comments

order

robviren|8 months ago

I look forward to a day when capabilities like this are trivial and boring to the average person. When my phone (locally) will be able to generate a fully voice acted 24 episode series anime on a whim for a meme with my group chat. It's astounding what we can do now, but will be completely ignorable before we know it, which is equally wild.

ljm|8 months ago

Literally nobody will give a fuck about a 24 episode series that exists because you spent a few seconds writing a prompt.

AI doesn’t increase the value of content, it makes it meaningless by destroying scarcity.

Tea. Earl Grey. Hot.

thebestmoshe|8 months ago

Who is going to spend time watching those episodes if content is so easy to make? Everyone will be busy watching their own generated content.

echelon|8 months ago

More content will be made in a single month than all of human history up to this point.

No more Disney-fication, no more Marvel / Star Wars "mass media slop". We'll have media that caters to people's long-tail interests. If you have a passion for Egyptology and Atlantis, you'll be able to watch a steampunk adventure about the Egyptians waging war with the Atlanteans. But perhaps with the serious tone of "The Wire". That would never have been greenlit before, but it'll soon be possible.

Good creators will arise just like good indie music, indie manga/comic, and indie game creators. Discovery will be the problem to solve for creators. There will be an abundance of talent that is finally able to create their vision rather than nepotism their way into one of 500 limited annual roles of autonomy.

Small creators who grow large like VivziePop [1] and PsychicPebbles [2] will be the model for the future of content. They start small on YouTube, grow large, and eventually have their own large-scale distribution and franchises.

The creative world is about to get orders of magnitude better. Not 2x, not 10x, but easily 1,000x.

I hate most movies and tv shows, but love the medium. The problem is most content produced just isn't my vibe. I like super artsy stuff, but also have particular tastes. That's going to change dramatically. Stuff will start fitting the shape of my interest graph.

I'm so excited.

[1] https://en.wikipedia.org/wiki/Vivienne_Medrano

[2] https://en.wikipedia.org/wiki/Zach_Hadel

Bombthecat|8 months ago

Me too!

Can't wait to create shadowrun movies for example :)

thebestmoshe|8 months ago

The future is something like the TikTok algorithm, but generated on the fly.

As you scroll, it learns what you like and generates more videos.

pizzathyme|8 months ago

With enough context fed into the model of what you react to, the content will be so mesmerizing that you won't be able to look away

This is chilling and also seems inevitable long term

Intralexical|8 months ago

I think this is an unfortunate misunderstanding of why people like and use social media.

ChatGPT can already generate endless "comments". And yet you're here.

ninetyninenine|8 months ago

That's the near future. Look farther down the line and it's netflix. Keep scrolling and it generates entire movies and shows based on what you like.

Probably before that though we'll see AI movies pre-generated before we see them generated on the fly during scrolling.

arbll|8 months ago

I think it will also try to influence what you like to maximize engagement unfortunately...

bemmu|8 months ago

Later on a "live mode" which is realtime generated content, guided by your voice. Netflix could also have this as a feature.

layer8|8 months ago

Will it learn that I don’t like ads?

pointlessone|8 months ago

Some of the shots are impressive but… Even among these hand-picked examples there’s a plenty of unnatural movement. And it seems like it was trained on the most hyperactive subset of tiktok as it apparently can’t hold a scene for more than 5 seconds.

tecleandor|8 months ago

While it pulls some pretty difficult things, it seems to struggle with other *seemingly* simple ones.

The piano in the beginning or the photo camera used by the photographer has "AI text" written on it. The old man with the beret in the cafe goes through his beret with his hand. The girl on the seaside looking back turns her head too much almost like an owl. The boy-in-a-bike-through-an-ewuropean-city scene ends on a square with an amorphous being in a unicycle under the tree...

GaggiX|8 months ago

Very nice being able to read an actual paper on a powerful text-to-video model.

liuliu|8 months ago

Yeah. It is great. So apparently separating spatial / temporal attention works if you are careful and train with large enough dataset too!

cchance|8 months ago

really coo, but wheres the sound? i'd expect that they'd have built in the sound model since its gonna look like SOTA for video, VEO3 is great for video but the audios what knocks it out of the park

paulluuk|8 months ago

I work on AI solutions for a major video streaming company, and the problem with VEO3 is that it doesn't have any consistency between prompts. E.g. I can not upload a reference image of what a character looks like, and if I say in one video "the old priest bends down" and in the next video "the old priests picks up the coin", the priest will look very different between shots.

Veo3 does support image to video, so what you can do is create an image that is the start of a scene, and then use that to generate the actual scene. Unfortunately, Veo3 is really bad at this. I expect this will improve over time.

Although I'm not super excited about this Seedance model personally, I do really like that it focuses on consistency between shots. I hope this puts pressure on increased performance from Veo3 in that regard.

silverlight|8 months ago

Is this available somewhere to use?

ivape|8 months ago

This is obviously going straight to TikTok. The big issue is it's going to open the flood gates on their own platform.

Anyway, if everyone wants to be a content creator, why not charge them for the privilege of that desire? A content creator will forever need AI-generated something. So now we move from "you get to post your content for free" over to "you get to now pay us through this AI-gateway to post your content".

electriclove|8 months ago

Why do all the examples have a large circle in them?

bytesandbots|8 months ago

Has the realism of AI already caught on to that of animated CGI movies?

I assume that an expert in CGI can point out obvious flaws in these outputs. But I wonder if it is possible to fix those details by prompting it to change only specific segments.

There is also the question of how much compute/money they are spending per second of output, compared to a high-budget Hollywood CGI.

layer8|8 months ago

Given how bad regular (non-animated) CGI often is in Hollywood movies nowadays, I don’t think the bar is that high.

Change management will indeed be interesting.

smusamashah|8 months ago

There is something in motion heavy videos that is making me nauseas/sick in my stomach. Last time I felt this was with first Sora release. It's not as bad as Sora, but its there. Veo 3 didn't gave me these feelings or may be I haven't seen its motion heavy samples.

Does anyone else feels same looking at motion heavy samples of Seedance?

benzible|8 months ago

"Old man" doesn't look that old to me (guess that means I'm old!)

citizenpaul|8 months ago

Am i the only one that finds these and all AI video rather underwhelming and more importantly subjectivly not that good? Sure the image quality is nice but nothing about them ever stick with me like countless small projects people make. I just see them and think, thats cool then forget about it forever.

It seems to me they all have subtle "badness" that makes them essentially useless. Quick example on this is the video of "guy sitting on the subway chair" has people passing in front of him even as the camera is about an inch from his face. Unless you are half asleep it is disconnecting in a way that my brain background processing says this is nonsense and I cannot care about it. It seems all AI video has this issue at this time in essentially infinite ways. Meaning I don't think there is any near term solution that will make them viable for production use.

Maybe I'm just old and not suited to this new hyper ADHD style media world.

burkaman|8 months ago

Like every AI launch demo I've ever seen, the results are unbelievably high quality, but if you take a second to read the prompts they never quite match. Here basically every single example is ignoring a portion of the prompt; sometimes the camera directions, sometimes the atmospheric description, sometimes making up very distinct elements that were not mentioned at all. People talk about "AI slop" because these models are really good when you just want "content" and you don't really care exactly what it looks like, but if you are trying to produce something specific, which you are in every real-world use case I can think of, it is very frustrating and often impossible to get there.

bieber|8 months ago

As a Chinese,I am proud of ByteDance. It make the China's AI industry at the top of the world. Although we have been banned by USA

mynti|8 months ago

you guys keep making everyone else kind of look bad. all eyes on china soon/now for AI

sergiotapia|8 months ago

Hollywood is cooked isn't it?

ramon156|8 months ago

Its been cooked for ~2-3 years now

DavidVoid|8 months ago

Man I hate when sites hijack the page up/down keys.

cchance|8 months ago

Sadly another closed model that will never go open weights i'd guess

energywut|8 months ago

We're so fucked, as a species.

People are already way too easy to get to believe conspiracy theories. Shit like Pizzagate or whatever is only going to get more common when bad people start making, "and look, here's the video proof!"

And we've already got Tiktok and Youtube Shorts just pumping the dopamine centers in the brain for short form content. Generating shit you like dynamically is going to be an addictive nightmare. The moment it gets monetized we're going to see the equivalent of slot machines pumped at us from every channel -- flashing lights and emotional tugs to get us to part with our valuable money or attention.

And that's to say nothing of the impact these tools have on artists and creative people or the costs to train and deploy these tools.

We're already seeing it today. The amount of 'footage' about LA right now that's showing some sort of war zone that is clearly AI generated, but being consumed as if it was real is staggering.

dachris|8 months ago

Yes, perfect AI content has multiple issues, that need to be addressed differently

- treating certain content in the same way that drugs are treated. Lots of countries are already moving towards age restrictions for social media.

- some kind of hardware-provided signatures for images and video, anything else must be assumed to be generated

Will be interesting for kids growing up - the peer pressure is now already very high to have smartphones, to be on Whatsapp, Instagram, TikTok, this will only get worse.

Maybe if I have kids I will found some Amish-like community with only 90's tech (only half joking).

bobxmax|8 months ago

That's been happening since Photoshop. Describing this as some apocalyptic event is absurd.

jadbox|8 months ago

We can only hope that people become aware that the Internet is a bullshit-machine and will only pull their news information from journalists, but I know this is wishful thinking.

Intralexical|8 months ago

We've basically flooded the information space with r-strategists.

In evolution, rapid reproduction gives an advantage to spamming low-quality offspring [1], and rapid selection without agglomeration [2,3] incentivizes antisocial behavior.

Ideas spread, mutate, and evolve just like animals [4]. So when the Internet made it free for anyone to transmit information to millions of people instantly, trustworthy information sources [5] and prosocial cultural values started dying [6], as literally the worst and craziest people become dominant [7,8,9,10,11].

...Presumably "AI" is going to make this even worse, and immeasurably so.

---

1. https://en.wikipedia.org/wiki/R/K_selection_theory

2. https://journals.plos.org/plosone/article?id=10.1371/journal...

3. https://en.wikipedia.org/wiki/Context_collapse

4. https://en.wikipedia.org/wiki/Memetics

5. https://en.wikipedia.org/wiki/Decline_of_newspapers

6. https://theweek.com/culture-life/third-places-disappearing

7. https://globalnews.ca/news/1157137/internet-trolls-are-sadis...

8. https://www.engadget.com/2018-03-19-study-shows-distribution...

9. https://old.reddit.com/r/slatestarcodex/comments/9rvroo/most...

10. https://www.ipr.northwestern.edu/news/2023/why-are-online-po...

11. https://en.wikipedia.org/wiki/Kakistocracy

yb6677|8 months ago

[deleted]

gherard5555|8 months ago

Can't wait for infinite ai generated video slop

bufferoverflow|8 months ago

Decent 1080p quality. Not bluray level, but getting close. Definitely ahead of every other video generator.

Video production just got a lot cheaper and requires very few skills. This is basically destroying the creative video production industry (ads, product videography, youtube content of all kinds) and probably VFX industry as well.

echelon|8 months ago

It's beating Google Veo 3 in the model arena:

https://artificialanalysis.ai/text-to-video/arena?tab=leader...

They've been running tests for weeks under the covert name "Unicorn" and just renamed the model to Seedance a few days ago.

edit: I'm not sure why I'm being downvoted for this, except perhaps not liking the ByteDance angle.

China produces incredibly good video models and have been in the market lead for at least a year now. All of the top video models, save for Veo 3, are from China.

In fact, the only open source video models of note are all non-American (mostly Chinese, and one Israeli model).

tartoran|8 months ago

I feel so bad for the next generation who will never have watched man made movies, they will not be able to tell whether something is junk or not because there will be no baseline.

pkkkzip|8 months ago

I dont think they care as long as the content is good. Even memes popular at that demographic are AI generated today.

Permik|8 months ago

Only light skinned people on the video examples. Ethnic diversity and accuracy used to be a problem with the models of the past. I wonder how the model would excel at prompts grokking at that.

Leary|8 months ago

0:10 into the video, that's light skinned to you?!

echelon|8 months ago

This model is from China. It isn't even being made available internationally (yet).