📚 세현's Vault

🌍 도메인

🔮3D-Vision
🎨Rendering
🤖Robotics
🧠LLM
👁️VLM
🎬GenAI
🥽XR
🎮Simulation
🛠️Dev-Tools
💰Crypto
📈Finance
📋Productivity
📦기타

📄 Papers

📚전체 논문172

❯

❯

dynamicvla

2026년 1월 30일1 min read

Robotics
manipulation

DailyPapers (@HuggingPapers)

2026-01-30 | ❤️ 265 | 🔁 45 | 💬 5

DynamicVLA

A compact 0.4B Vision-Language-Action model that finally lets robots manipulate moving objects in real-time, closing the perception-execution gap with Continuous Inference and Latent-aware Action Streaming. https://x.com/HuggingPapers/status/2017094507402318169/video/1

🔗 링크

https://x.com/HuggingPapers/status/2017094507402318169/video/1

미디어

🔗 Related

first-fully-open-action-reasoning-model-arm-can-think-in-3d- — 도메인: VLM, Robotics/Manipulation
lingbot-depth-masked-depth-modeling-for-spatial-perception-941785 — 같은 저자 @HuggingPapers
can-we-bridge-the-sim-to-real-gap-in-complex-manipulation-wi-683188 — 도메인: Robotics/Manipulation
the-next-evolution-vla-models — 주제: Vla, Robotics Manipulation
what-if-your-robot-or-car-could-see-depth-more-clearly-than- — 도메인: Robotics/Manipulation

Tags

그래프 뷰

DailyPapers (@HuggingPapers)
🔗 링크
미디어
🔗 Related
Tags

백링크

DynamicVLA
can-we-bridge-the-sim-to-real-gap-in-complex-manipulation-wi-683188
first-fully-open-action-reasoning-model-arm-can-think-in-3d-
Introducing vla-scratch: a modular, performant and efficient stack for VLAs. https://github.com/EGalahad/vla-scratch
lingbot-depth-masked-depth-modeling-for-spatial-perception-941785
The next evolution: VLA+ models
Video models serve as a good pretrained backbone for robot policies.
Vision alone isn’t enough to solve dexterous manipulation. The sense of touch is needed.
what-if-your-robot-or-car-could-see-depth-more-clearly-than-
why-do-generalist-robotic-models-fail-when-a-cup-is-moved-ju
domain-Robotics
enriched-codex

Created with Quartz v4.5.2 © 2026

GitHub
Sehyeon Park