MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation Paper • 2510.18692 • Published Oct 21, 2025 • 41
Hulu-Med: A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding Paper • 2510.08668 • Published Oct 9, 2025 • 5
MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement Paper • 2509.01977 • Published Sep 2, 2025 • 13
Token Activation Map to Visually Explain Multimodal LLMs Paper • 2506.23270 • Published Jun 29, 2025 • 5
CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No Paper • 2308.12213 • Published Aug 23, 2023