Original Tweet
Exciting new work on detailed (pixel-level, dense) 3D visual understanding of videos. Based on a scalable feedforward architecture, itโs super fast and super accurate (SOTA). Lots of uses in robotics, AR, world modellingโฆ Check it out!
๐ Related
- do-we-really-need-an-external-world-model โ ์ฃผ์ : World Model
- we-introduce-egowm-a-video-world-model-that-simulates-eve-1x-humanoid-interactio โ ์ฃผ์ : World Model
- what-does-a-robot-actually-see-when-it-loads-a-truck โ ์ฃผ์ : World Model
- what-if-we-could-train-ai-robots-in-a-perfect-physics-accurate-simulation โ ์ฃผ์ : World Model
- what-if-your-robot-or-car-could-see-depth-more-clearly-than- โ ์ฃผ์ : World Model