TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows Paper • 2512.05150 • Published 7 days ago • 57
Mobile User Interface Element Detection Via Adaptively Prompt Tuning Paper • 2305.09699 • Published May 16, 2023
DiffusionInst: Diffusion Model for Instance Segmentation Paper • 2212.02773 • Published Dec 6, 2022
InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation Paper • 2504.10905 • Published Apr 15
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding Paper • 2510.06308 • Published Oct 7 • 53
MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks Paper • 2509.14638 • Published Sep 18 • 11
GroveMoE Collection GroveMoE is an open-source family of large language models developed by the AGI Center, Ant Research Institute. • 4 items • Updated Oct 13 • 7
GroveMoE Collection GroveMoE is an open-source family of large language models developed by the AGI Center, Ant Research Institute. • 4 items • Updated Oct 13 • 7
GroveMoE Collection GroveMoE is an open-source family of large language models developed by the AGI Center, Ant Research Institute. • 4 items • Updated Oct 13 • 7
Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training Paper • 2408.17081 • Published Aug 30, 2024
Towards Explainable Fake Image Detection with Multi-Modal Large Language Models Paper • 2504.14245 • Published Apr 19