3 14 16

wybertwang PRO

wybertwang

http://ttengwang.com/

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries

upvoted a paper 21 days ago

DoPE: Denoising Rotary Position Embedding

liked a Space about 1 month ago

Gen-Verse/MMaDA

View all activity

Organizations

upvoted a paper 18 days ago

ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries

Paper • 2511.14349 • Published 20 days ago • 16

upvoted a paper 21 days ago

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published 26 days ago • 92

liked a Space about 1 month ago

MMaDA

🌍

Demo for MMaDA: Multimodal Large Diffusion Language Models

authored a paper about 1 month ago

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22 • 29

upvoted 2 papers about 1 month ago

Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors

Paper • 2509.00969 • Published Aug 31 • 2

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22 • 29

liked a dataset 3 months ago

HuggingFaceM4/FineVision

Viewer • Updated Oct 21 • 24.2M • 177k • 450

published a Space 3 months ago

AudioStory

💬

AudioStory

liked a model 3 months ago

TencentARC/AudioStory-3B

Updated Sep 30 • 6 • 7

updated a model 3 months ago

TencentARC/AudioStory-3B

Updated Sep 30 • 6 • 7

published a model 3 months ago

TencentARC/AudioStory-3B

Updated Sep 30 • 6 • 7

commented a paper 3 months ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27 • 21 •

authored 2 papers 3 months ago

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Paper • 2507.20939 • Published Jul 28 • 56

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27 • 21

commented a paper 3 months ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27 • 21 •

upvoted a paper 3 months ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27 • 21

liked a Space 4 months ago

DepthCrafter

🦀

191

a super consistent video depth model

upvoted a paper 4 months ago

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Paper • 2507.20939 • Published Jul 28 • 56

published 2 datasets 5 months ago

wybertwang/unav_grounding

Viewer • Updated May 13 • 3.06k • 13

wybertwang/anet_val2_grounding

Viewer • Updated Jul 3 • 17k • 10

wybertwang PRO

AI & ML interests

Recent Activity

Organizations

wybertwang's activity

MMaDA

AudioStory

DepthCrafter