Kwang Moo Yi (@kwangmoo_yi)

2025-03-10 | โค๏ธ 193 | ๐Ÿ” 32


Preprint of today: Jing et al., โ€œStereo Any Video: Temporally Consistent Stereo Matchingโ€ โ€” https://tomtomtommi.github.io/StereoAnyVideo/

Not everything has to be diffusion-based: Use off-the-shelf monocular video depth estimator (diffusion :() + RAFT-like architecture. Cost volumes strikes back! https://x.com/kwangmoo_yi/status/1899172898579108307/video/1

๐Ÿ”— ์›๋ณธ ๋งํฌ

๋ฏธ๋””์–ด

video


Auto-generated - needs manual review

Tags

domain-vision-3d domain-genai domain-visionos