📚 세현's Vault

🌍 도메인

🔮3D-Vision
🎨Rendering
🤖Robotics
🧠LLM
👁️VLM
🎬GenAI
🥽XR
🎮Simulation
🛠️Dev-Tools
💰Crypto
📈Finance
📋Productivity
📦기타

📄 Papers

📚전체 논문172

❯

❯

egox egocentric video generation from a single exocentric

egox-egocentric-video-generation-from-a-single-exocentric

2026년 1월 01일2 min read

3D-Vision
generation
editing

Embodied AI Reading Notes (@EmbodiedAIRead)

2026-01-01 | ❤️ 112 | 🔁 21

EgoX: Egocentric Video Generation from a Single Exocentric Video

Project: https://keh0t0.github.io/EgoX/ Paper: https://arxiv.org/pdf/2512.08269

This paper achieves high geometric coherence and visual fidelity when generating egocentric videos from a single exocentric video.

Why this is important: consistent egocentric videos open up a lot of new spaces for robot learning. If egocentric videos can be generated in high quality, diversity and quantity, it makes synthetic data a powerful complement to real-world data.
Problem definition: given an exocentric video sequence and ego-centric camera poses, goal is to generate a corresponding ego centric video sequence that depicts the same scene from first-person viewpoint.
Challenge: preserve the visible content in the exocentric view while synthesizing unseen regions in a geometrically consistent and realistic manner.
Method: (1) The exocentric sequence is first lifted into a 3D Point Cloud representation and rendered from the target egocentric viewpoint which becomes an egocentric prior video. (2) This prior video and original exocentric video are then provided as inputs to a LORA-adapted pretrained video diffusion model to generate egocentric video. (3) A geometric-guided self-attention in DiT is used to adaptively focus on view-consistent regions and enhance feature coherence across perspectives.

🔗 원본 링크

https://keh0t0.github.io/EgoX/
https://arxiv.org/pdf/2512.08269

미디어

🔗 Related

Auto-generated - needs manual review

Tags

3D Rendering Robotics AI-ML GenAI

그래프 뷰

Embodied AI Reading Notes (@EmbodiedAIRead)
🔗 원본 링크
미디어
🔗 Related
Tags

백링크

domain-3D-Vision

Created with Quartz v4.5.2 © 2026

GitHub
Sehyeon Park