Yasunori Ozaki's picture

In a Training Loop 🔄

Yasunori Ozaki PRO

alfredplpl

·

https://alfredplpl.github.io/en/index.html

AI & ML interests

Computer Vision, LLM

Recent Activity

upvoted a collection about 24 hours ago

upvoted a changelog 5 days ago

Duplicate Datasets

upvoted a paper 5 days ago

Glance: Accelerating Diffusion Models with 1 Sample

View all activity

Organizations

upvoted a collection about 24 hours ago

Z-Image

4 items • Updated 7 days ago • 74

upvoted a changelog 5 days ago

Changelog

Duplicate Datasets

5 days ago

• 64

upvoted a paper 5 days ago

Glance: Accelerating Diffusion Models with 1 Sample

Paper • 2512.02899 • Published 6 days ago • 25

upvoted a paper 15 days ago

Back to Basics: Let Denoising Generative Models Denoise

Paper • 2511.13720 • Published 21 days ago • 64

upvoted a collection about 1 month ago

WAON

WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models • 4 items • Updated Oct 28 • 1

upvoted 2 papers about 1 month ago

WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models

Paper • 2510.22276 • Published Oct 25 • 3

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27 • 57

upvoted 2 papers about 2 months ago

UltraGen: High-Resolution Video Generation with Hierarchical Attention

Paper • 2510.18775 • Published Oct 21 • 17

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17 • 50

upvoted a collection about 2 months ago

RAE

Collection for Diffusion Transformers with Representation Autoencoders • 1 item • Updated Oct 14 • 10

upvoted 2 papers 2 months ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2 • 95

SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer

Paper • 2509.24695 • Published Sep 29 • 45

upvoted a paper 3 months ago

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Paper • 2509.09676 • Published Sep 11 • 32

upvoted a collection 3 months ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1 • 304

upvoted an article 4 months ago

Article

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

Aug 8

•

29

upvoted 3 collections 4 months ago

DeepSeek-V3.1

4 items • Updated 11 days ago • 254

Re-LAION-5B-research

Re-LAION-5B-research • 3 items • Updated Oct 20 • 3

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 391

upvoted a paper 4 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 263

upvoted a collection 4 months ago

Skywork-UniPic

Unified Autoregressive Modeling for Visual Understanding and Generation • 2 items • Updated Aug 13 • 12