ViPE: Video Pose Engine for 3D Geometric Perception

Contributions: • A robust and efficient framework, ViPE, for estimating camera parameters and dense depth from diverse, in-the-wild videos.

• A system design that integrates the strengths of classical SLAM (efficiency, scalability) and learned models (robustness), with key improvements in efficiency, dynamic object handling, and depth quality over prior work.

• A large-scale dataset of annotated videos, created using ViPE, to facilitate future research in 3D computer vision.

📚 세현's Vault

🌍 도메인

📄 Papers

ViPE: Video Pose Engine for 3D Geometric Perception

그래프 뷰

백링크