MrNeRF (@janusch_patas)

2025-07-18 | ❤️ 452 | 🔁 57

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

Contributions: • We introduce Diffuman4D, a novel diffusion model that generates spatio-temporally consistent and high-resolution (1024p) human videos from sparse-view video inputs.

• We propose a sliding iterative denoising mechanism that enhances both the spatial and temporal consistency of generated long-term videos while maintaining efficient inference.

• We design a human pose conditioning scheme to enhance the appearance quality and motion accuracy of generated human videos.

• We plan to release our processed version of the DNA-Rendering dataset, which we believe will benefit future research in this area.

See similar notes in domain-rendering, domain-genai

📚 세현's Vault

🌍 도메인

📄 Papers

diffuman4d-4d-consistent-human-view-synthesis-from-sparse-view-videos-with

MrNeRF (@janusch_patas)

Tags

그래프 뷰

목차

백링크

📚 세현's Vault

🌍 도메인

📄 Papers

diffuman4d-4d-consistent-human-view-synthesis-from-sparse-view-videos-with

MrNeRF (@janusch_patas)

🔗 Related

Tags

그래프 뷰

목차

백링크