Composed Image Retrieval for Training-Free Domain Conversion Paper • 2412.03297 • Published Dec 4, 2024 • 1
REGLUE Your Latents with Global and Local Semantics for Entangled Diffusion Paper • 2512.16636 • Published Dec 18, 2025 • 26
Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation? Paper • 2602.23339 • Published 3 days ago • 5
Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation? Paper • 2602.23339 • Published 3 days ago • 5
Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation? Paper • 2602.23339 • Published 3 days ago • 5
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation Paper • 2503.19777 • Published Mar 25, 2025 • 1
Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders Paper • 2509.00177 • Published Aug 29, 2025
Processing and acquisition traces in visual encoders: What does CLIP know about your camera? Paper • 2508.10637 • Published Aug 14, 2025 • 8
Attention, Please! Revisiting Attentive Probing for Masked Image Modeling Paper • 2506.10178 • Published Jun 11, 2025 • 7
ProbPose: A Probabilistic Approach to 2D Human Pose Estimation Paper • 2412.02254 • Published Dec 3, 2024