top | item 44648777

(no title)

rdegges | 7 months ago

Great article. This may be my all-time favorite deep dive post on RAG strategies.

It’s super interesting to me how the process of fully making audio/video searchable requires so much processing. Like, extracting the audio and video, transcribing the audio, chunking the video into 15-sec scenes and describing them visually, etc.

I wonder if as a test you could use the video descriptions, run them as a prompt through something like Veo, then stitch them together into something close to the original. Wild.

discuss

order

mkauffman23|7 months ago

I have no idea how accurate the reconstruction would be but it would make for a wild experminent!