Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context Paper • 2603.15653 • Published 12 days ago • 4
SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation Paper • 2603.14152 • Published 4 days ago • 6
SegviGen: Repurposing 3D Generative Model for Part Segmentation Paper • 2603.16869 • Published 1 day ago • 16
GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering Paper • 2603.15616 • Published 2 days ago • 4
Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods Paper • 2603.15026 • Published 3 days ago • 8
ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer Paper • 2603.15478 • Published 3 days ago • 23
DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning Paper • 2603.12257 • Published 6 days ago • 31
WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing Paper • 2603.11593 • Published 7 days ago • 24
One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers Paper • 2603.12245 • Published 6 days ago • 17
LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation Paper • 2603.10899 • Published 8 days ago • 6