Jiafei Duan (@DJiafei)

2026-02-05 | ❤️ 129 | 🔁 21 | 💬 2

Why do generalist robotic models fail when a cup is moved just two inches to the left? It’s not a lack of motor skill, it’s an alignment problem. Today, we introduce VLS: Vision-Language Steering of Pretrained Robot Policies, a training-free framework that guides robot behavior in real time.

Check out the project: https://vision-language-steering.github.io/webpage/ 👇🧵 (Watch till the end: VLS runs uncut, steering pretrained policies across long-horizon tasks.)

📄 원문 내용

VLS: Steering Pretrained Robot Policies via Vision–Language Models

VLS: Steering Pretrained Robot Policies via Vision-Language Models - A training-free framework for inference-time adaptation of frozen generative robot policies.

미디어

🎬 영상

dynamicvla — 도메인: VLM, Robotics/Manipulation
first-fully-open-action-reasoning-model-arm-can-think-in-3d- — 도메인: VLM, Robotics/Manipulation
bringing-foundation-models-to-depth-sensing-defm-is-trained- — 도메인: Robotics/Manipulation
can-we-bridge-the-sim-to-real-gap-in-complex-manipulation-wi-683188 — 도메인: Robotics/Manipulation
lingbot-depth-masked-depth-modeling-for-spatial-perception-941785 — 도메인: Robotics/Manipulation

📚 세현's Vault

🌍 도메인

📄 Papers

why-do-generalist-robotic-models-fail-when-a-cup-is-moved-ju

Jiafei Duan (@DJiafei)

📄 원문 내용

미디어

Tags

그래프 뷰

목차

백링크

📚 세현's Vault

🌍 도메인

📄 Papers

why-do-generalist-robotic-models-fail-when-a-cup-is-moved-ju

Jiafei Duan (@DJiafei)

📄 원문 내용

미디어

🔗 Related

Tags

그래프 뷰

목차

백링크