Not to belittle this or anything (it does look good and show promise), it feels like they somehow generate several consistent (but discrete) views of a given world, then feed all that to the good old pose estimation + gaussian splatting workflow. Whenever you leave the generated area (which isn't exactly huge on the few I tested) you get tell-tale signs of GS.
xg15|3 months ago
embedding-shape|3 months ago
The interior scenes look and walks great, but any scenes with/in exteriors seems kind of bad.
kkukshtel|3 months ago