top | item 44381640

(no title)

banana_giraffe | 8 months ago

You can use yt-dlp to get the transcripts. For instance, to grab just the transcript of a video:

    ./yt-dlp --skip-download --write-sub --write-auto-sub --sub-lang en --sub-format json3 <youtube video URL>
You can also feed the same command a playlist or channel URL and it'll run through and grab all the transcripts for each video in the playlist or channel.

discuss

order

fallinditch|8 months ago

That's cool, thanks for the info. But do you also have to use a rotating proxy to prevent YouTube from blocking your IP address?

banana_giraffe|8 months ago

Last time I ran this at scale was a couple of months ago, so my information is no doubt out of date, but in my experience, YouTube seems less concerned about this than they are when you're grabbing lots of videos.

But that was a few months ago, so for all I know they've tightened down more hatches since then.