Haoran Geng (@HaoranGeng2)

2026-01-06 | โค๏ธ 289 | ๐Ÿ” 31


This might be my โ€œaha momentโ€ of 2025:

With our new robotics foundation model, Large Video Planner, we train a robot planner from large-scale video data. It works so well that we can use it directly for robot planning.

Two moments really blew my mind: First: right after our model training, I fed in an image of my hand and my MacBook and asked it to close the laptopโ€”when the Apple logo appeared exactly as the lid came down, I couldnโ€™t help but feel impressed (and excited). Second demo: picking up the brush โ€” check the 3D consistency. Even the brush shadow is remarkably accurate, and it can even infer what the Franka arm (at the corner) should look like.

๋ฏธ๋””์–ด

image


Auto-generated - needs manual review

Tags

3D Rendering Robotics AI-ML