Zoubin Ghahramani (@ZoubinGhahrama1)

2026-01-22 | โค๏ธ 109 | ๐Ÿ” 5 | ๐Ÿ’ฌ 3


Exciting new work on detailed (pixel-level, dense) 3D visual understanding of videos. Based on a scalable feedforward architecture, itโ€™s super fast and super accurate (SOTA). Lots of uses in robotics, AR, world modellingโ€ฆ Check it out!

์ธ์šฉ๋œ ํŠธ์œ—

@GoogleDeepMind: Weโ€™re helping AI to see the 3D world in motion as humans do. ๐ŸŒ

Enter D4RT: a unified model that turns video into 4D representations faster than previous methods - enabling it to understand space and โ€ฆ


์ธ์šฉ ํŠธ์œ—

Google DeepMind (@GoogleDeepMind)

Weโ€™re helping AI to see the 3D world in motion as humans do. ๐ŸŒ

Enter D4RT: a unified model that turns video into 4D representations faster than previous methods - enabling it to understand space and time. This is how it works ๐Ÿงต

์›๋ณธ ํŠธ์œ—

๐ŸŽฌ ์˜์ƒ

Tags

Vision-3D Robotics GenAI AI-ML