Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction Paper • 2510.03117 • Published Oct 3 • 11
BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain Paper • 2409.20075 • Published Sep 30, 2024 • 2
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering Paper • 2503.16867 • Published Mar 21 • 11