AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration Paper • 2510.10395 • Published Oct 12 • 30
VLKEB: A Large Vision-Language Model Knowledge Editing Benchmark Paper • 2403.07350 • Published Mar 12, 2024 • 1