StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors Paper • 2512.16915 • Published 9 days ago • 37
view article Article Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation 11 days ago • 36
MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives Paper • 2512.14699 • Published 11 days ago • 27
MIND-V: Hierarchical Video Generation for Long-Horizon Robotic Manipulation with RL-based Physical Alignment Paper • 2512.06628 • Published 21 days ago • 12
ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation Paper • 2512.03621 • Published 24 days ago • 8
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation Paper • 2512.07831 • Published 19 days ago • 16
Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO Paper • 2511.16669 • Published Nov 20 • 31
PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning Paper • 2510.13809 • Published Oct 15 • 37
LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation Paper • 2508.07603 • Published Aug 11 • 1
VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning Paper • 2510.08555 • Published Oct 9 • 63
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning Paper • 2507.13348 • Published Jul 17 • 77
LayerFlow: A Unified Model for Layer-aware Video Generation Paper • 2506.04228 • Published Jun 4 • 13