Back to Basics: Let Denoising Generative Models Denoise Paper • 2511.13720 • Published 21 days ago • 64
WAON Collection WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models • 4 items • Updated Oct 28 • 1
WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models Paper • 2510.22276 • Published Oct 25 • 3
UltraGen: High-Resolution Video Generation with Hierarchical Attention Paper • 2510.18775 • Published Oct 21 • 17
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset Paper • 2510.15742 • Published Oct 17 • 50
RAE Collection Collection for Diffusion Transformers with Representation Autoencoders • 1 item • Updated Oct 14 • 10
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation Paper • 2510.02283 • Published Oct 2 • 95
SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer Paper • 2509.24695 • Published Sep 29 • 45
SpatialVID: A Large-Scale Video Dataset with Spatial Annotations Paper • 2509.09676 • Published Sep 11 • 32
Apertus LLM Collection Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1 • 304
view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware Aug 8 • 29
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 391
Skywork-UniPic Collection Unified Autoregressive Modeling for Visual Understanding and Generation • 2 items • Updated Aug 13 • 12