rsasaki0109 (@rsasaki0109)

2025-05-04 | โค๏ธ 157 | ๐Ÿ” 28


SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Models (RSS 2025) https://github.com/SpatialVLA/SpatialVLA A spatial-enhanced vision-language-action model trained on 1.1 Million real robot episodes. ๐Ÿค— purely huggingFace-based, concise code with efficient performance. https://x.com/rsasaki0109/status/1919158640247968240/photo/1

๐Ÿ”— ์›๋ณธ ๋งํฌ

๋ฏธ๋””์–ด

image


Auto-generated - needs manual review

Tags

domain-robotics domain-ai-ml domain-dev-tools domain-visionos