view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 389
PresentAgent: Multimodal Agent for Presentation Video Generation Paper • 2507.04036 • Published Jul 5, 2025 • 10
Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing Paper • 2506.03197 • Published Jun 1, 2025 • 4
Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion Paper • 2405.11286 • Published May 18, 2024 • 1
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning Paper • 2412.03248 • Published Dec 4, 2024 • 26