top | item 39119727

(no title)

baldgeek | 2 years ago

2 clicks from the Posted Link: "Read Paper", then "Code, Data and Media" tab will get you the dataset used (https://paperswithcode.com/dataset/ucf101)

discuss

gs17|2 years ago

That's not the dataset used for training. From the paper:

>We train our T2V model on a dataset containing 30M videos along with their text caption. [...] We evaluate our model on a collection of 113 text prompts describing diverse objects and scenes. The prompt list consists of 18 prompts assembled by us and 95 prompts used by prior works (Singer et al., 2022; Ho et al., 2022a; Blattmann et al., 2023b) (see App. B). Additionally, we employ a zero-shot evaluation protocol on the UCF101 dataset >

sjwhevvvvvsj|2 years ago

Well in the Ai/ML era maybe “models or gtfo” is better. Training data is just common crawl for half these LMs.