Im not sure how the depth estimation alone translates into the view synthesis, but the current implementation on-device is definitely not convincing for literally any portrait photographs I have seen.
True stereoscopic captures are convincing statically, but don't provide the parallax.
Good monocular depth estimation is crucial if you want to make a 3D representation from a single image. Ordinarily you have images from several camera poses and can create the gaussian splats using triangulation, with a single image you have to guess z position for them.
supermatt|2 months ago
True stereoscopic captures are convincing statically, but don't provide the parallax.
sorenjan|2 months ago