📚 세현's Vault

🌍 도메인

🔮3D-Vision
🎨Rendering
🤖Robotics
🧠LLM
👁️VLM
🎬GenAI
🥽XR
🎮Simulation
🛠️Dev-Tools
💰Crypto
📈Finance
📋Productivity
📦기타

📄 Papers

📚전체 논문172

Home

❯

bookmarks

❯

RayRoPE: Projective Ray Positional Encoding for Multi view Attention

RayRoPE: Projective Ray Positional Encoding for Multi-view Attention

2026년 2월 05일1 min read

3D-Vision
multi-view-transformer

Shubham Tulsiani (@shubhtuls)

2026-02-05 | ❤️ 730 | 🔁 91

[1/N] Rotary Position Embeddings (RoPE) are ubiquitous across transformers that process tokens from 1D, 2D, or 3D grids e.g. language, images, or videos. Our RayRoPE formulation extends these to multi-view transformers. Paper and code: https://t.co/abVobLRJxq https://t.co/cYhczUqrGc

미디어

video thumbnail

📚 세현's Vault

🌍 도메인

📄 Papers

RayRoPE: Projective Ray Positional Encoding for Multi-view Attention

Shubham Tulsiani (@shubhtuls)

미디어

Tags

그래프 뷰

목차