Haoran Geng (@HaoranGeng2)
2026-01-06 | โค๏ธ 289 | ๐ 31
This might be my โaha momentโ of 2025:
With our new robotics foundation model, Large Video Planner, we train a robot planner from large-scale video data. It works so well that we can use it directly for robot planning.
Two moments really blew my mind: First: right after our model training, I fed in an image of my hand and my MacBook and asked it to close the laptopโwhen the Apple logo appeared exactly as the lid came down, I couldnโt help but feel impressed (and excited). Second demo: picking up the brush โ check the 3D consistency. Even the brush shadow is remarkably accurate, and it can even infer what the Franka arm (at the corner) should look like.
๋ฏธ๋์ด
![]()
๐ Related
Auto-generated - needs manual review