Dmytro Mishkin ๐Ÿ‡บ๐Ÿ‡ฆ (@ducha_aiki)

2025-02-24 | โค๏ธ 112 | ๐Ÿ” 19


ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval

@zgq1879, Yuanpei Liu, @hankaixyz , @WeidiXie Andrew Zisserman

tl;dr: condition the visual encoder on text query for better retrieval. https://arxiv.org/abs/2502.15682 https://x.com/ducha_aiki/status/1893976365805752790/photo/1

๐Ÿ”— ์›๋ณธ ๋งํฌ

๋ฏธ๋””์–ด

photo

photo

photo

photo


Auto-generated - needs manual review

Tags

domain-dev-tools