MrNeRF (@janusch_patas)

2024-11-21 | โค๏ธ 123 | ๐Ÿ” 14


Generating 3D-Consistent Videos from Unposed Internet Photos

Abstract (excerpt) A handful of input images serve as keyframes, and our model interpolates between them to simulate a path moving between the cameras.

Given random images, a modelโ€™s ability to capture underlying geometry, recognize scene identity, and relate frames in terms of camera position and orientation reflects a fundamental understanding of 3D structure and scene layout.

However, existing video models such as Luma Dream Machine fail at this task.

We design a self-supervised method that takes advantage of the consistency of videos and variability of multiview internet photos to train a scalable, 3D-aware video model without any 3D annotations such as camera parameters.

โ€ฆ We also show our model benefits applications that enable camera control, such as 3D Gaussian Splatting.

๋ฏธ๋””์–ด

video


Tags

domain-vision-3d