top | item 47160557 (no title) nee1r | 4 days ago thanks! the math and architecture of the FDM (no video encoder) is pretty simple, its a regular transformer with next-token predictions but with frames interleaved. discuss order hn newest No comments yet.
No comments yet.