Original Tweet
What if we could model vision like a wave moving through space?
Researchers from Peking & Tsinghua Universities present WaveFormer.
They treat image features as signals governed by a wave equation, explicitly controlling how low-to-high frequency details evolve across network layers.
This new Wave Propagation Operator outperforms standard Vision Transformers in image classification, detection, and segmentation, achieving up to 1.6x higher throughput with 30% fewer computations.
WaveFormer: Frequency-Time Decoupled Vision Modeling with Wave Equation
Paper: https://arxiv.org/abs/2601.08602 Code: https://github.com/ZishanShu/WaveFormer
Our report: https://mp.weixin.qq.com/s/xFoj94IIG4xjucJvew8ilQ
๐ฌ PapersAccepted by Jiqizhixin
๐ ์๋ณธ ๋งํฌ
- https://arxiv.org/abs/2601.08602
- https://github.com/ZishanShu/WaveFormer
- https://mp.weixin.qq.com/mp/wappoc_appmsgcaptcha?poc_token=HIoHh2mjJrxZC4nFbvSZypLWr6oRWcWq2-aVfTID&target_url=https%3A%2F%2Fmp.weixin.qq.com%2Fs%2FxFoj94IIG4xjucJvew8ilQ
๋ฏธ๋์ด

๐ Related
- 1n-rotary-position-embeddings-rope-are-ubiquitous-across-tra โ ์ฃผ์ : Transformer
- 1n-rotary-position-embeddings-rope-are-ubiquitous-across-transformers-that โ ์ฃผ์ : Transformer
- iggt-instance-grounded-geometry-transformer โ ์ฃผ์ : Transformer
- introducing-shaper-a-method-for-robust-conditional-3d-shape-generation-from-casu โ ์ฃผ์ : Transformer
- mamba-policy-towards-efficient-3d-diffusion-policy-with-hybrid-selective-state-m โ ์ฃผ์ : Transformer