1 15 19

Zilong Chen

heheyas

heheyas

AI & ML interests

3D Vision

Recent Activity

upvoted a paper 4 days ago

UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers

upvoted a paper 13 days ago

UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers

upvoted a paper 13 days ago

VQ-VA World: Towards High-Quality Visual Question-Visual Answering

View all activity

Organizations

upvoted a paper 4 days ago

UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers

Paper • 2512.04504 • Published 5 days ago • 15

upvoted 2 papers 13 days ago

UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers

Paper • 2511.20123 • Published 14 days ago • 16

VQ-VA World: Towards High-Quality Visual Question-Visual Answering

Paper • 2511.20573 • Published 14 days ago • 7

upvoted a paper about 1 month ago

LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

Paper • 2510.22946 • Published Oct 27 • 16

liked a dataset 2 months ago

WINDop/OpenGPT-4o-Image

Updated Nov 2 • 2.3k • 18

liked a dataset 4 months ago

multimodal-reasoning-lab/Zebra-CoT

Viewer • Updated Jul 26 • 160k • 7.17k • 58

liked a Space 4 months ago

RISEBench Gallery

👀

A Gallery of Generation Results on RISEBench

liked a dataset 5 months ago

xcpan/MetaQuery_Instruct_2.4M

Viewer • Updated Jun 30 • 2.28M • 4.67k • 6

authored a paper 6 months ago

From Virtual Games to Real-World Play

Paper • 2506.18901 • Published Jun 23 • 10

liked 2 models 6 months ago

ByteDance-Seed/BAGEL-7B-MoT

Any-to-Any • 15B • Updated 1 day ago • 860 • 1.17k

madebyollin/taef1

Updated Aug 10, 2024 • 5.58k • 44

upvoted a paper 6 months ago

ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding

Paper • 2506.01853 • Published Jun 2 • 32

liked a dataset 7 months ago

HuggingFaceM4/the_cauldron

Viewer • Updated May 6, 2024 • 1.88M • 67.1k • 508

published a model 9 months ago

heheyas/MeshGen

Updated Oct 7, 2024 • 1

upvoted 2 papers 9 months ago

RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers

Paper • 2502.15894 • Published Feb 21 • 20

DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Paper • 2503.15265 • Published Mar 19 • 46

liked a model 9 months ago

thu-ml/Hunyuan-RIFLEx

Updated Feb 24 • 1

upvoted a collection 9 months ago

OpenX-LeRobot

Collection

Open X-Embodiment datasets in LeRobot format with standard transfomation (https://github.com/Tavish9/any4lerobot) • 34 items • Updated Aug 28 • 25

liked 2 Spaces 10 months ago

Diffusion Forcing Transformer

✨

Generate a video from any number of images

Visualize Dataset (v2.0+ latest dataset format)

💻

285

Visualize LeRobot Datasets