Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models Paper • 2412.18605 • Published Dec 24, 2024 • 21
Orient Anything V2: Unifying Orientation and Rotation Understanding Paper • 2601.05573 • Published 4 days ago • 8
Orient Anything V2: Unifying Orientation and Rotation Understanding Paper • 2601.05573 • Published 4 days ago • 8
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion Paper • 2405.04883 • Published May 8, 2024
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces Paper • 2407.11895 • Published Jul 16, 2024 • 7
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Paper • 2408.16532 • Published Aug 29, 2024 • 50
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization Paper • 2410.12957 • Published Oct 16, 2024 • 8
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup Paper • 2410.21269 • Published Oct 28, 2024
APO: Enhancing Reasoning Ability of MLLMs via Asymmetric Policy Optimization Paper • 2506.21655 • Published Jun 26, 2025
DSI-Bench: A Benchmark for Dynamic Spatial Intelligence Paper • 2510.18873 • Published Oct 21, 2025 • 8