Saumya Saxena (@saxena_saumya)
2024-12-29 | ❤️ 146 | 🔁 29
Can 3D scene graphs act as effective online memory for solving EQA tasks in⚡️real-time?
Presenting GraphEQA🤖, a framework for grounding Vision Language Models using multimodal memory for real-time embodied question answering. https://x.com/saxena_saumya/status/1873408926743679206/photo/1
미디어
![]()