UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience Paper • 2603.24533 • Published about 14 hours ago • 4
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents Paper • 2603.24440 • Published about 16 hours ago • 2
Toward Physically Consistent Driving Video World Models under Challenging Trajectories Paper • 2603.24506 • Published about 15 hours ago • 2
OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning Paper • 2603.24458 • Published about 15 hours ago • 1
GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents Paper • 2603.24329 • Published about 17 hours ago • 12
RealMaster: Lifting Rendered Scenes into Photorealistic Video Paper • 2603.23462 • Published 1 day ago • 23
MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding Paper • 2603.22458 • Published 3 days ago • 113
Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs Paper • 2603.22446 • Published 3 days ago • 4
ABot-PhysWorld: Interactive World Foundation Model for Robotic Manipulation with Physics Alignment Paper • 2603.23376 • Published 1 day ago • 2
RealMaster: Lifting Rendered Scenes into Photorealistic Video Paper • 2603.23462 • Published 1 day ago • 23
WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG Paper • 2603.23497 • Published 1 day ago • 71
PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost Paper • 2603.21383 • Published 3 days ago • 14
Effective Strategies for Asynchronous Software Engineering Agents Paper • 2603.21489 • Published 3 days ago • 5
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation Paper • 2603.22117 • Published 3 days ago • 22
WorldCache: Content-Aware Caching for Accelerated Video World Models Paper • 2603.22286 • Published 3 days ago • 4
Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD Paper • 2603.20155 • Published 6 days ago • 7
WorldAgents: Can Foundation Image Models be Agents for 3D World Models? Paper • 2603.19708 • Published 6 days ago • 11