videotransfusion

classroom

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

xhan77 authored a paper 28 days ago

LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation

xhan77 authored a paper 28 days ago

MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation

xhan77 authored a paper 28 days ago

TV2TV: A Unified Framework for Interleaved Language and Video Generation

View all activity

xhan77

authored 3 papers 28 days ago

LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation

Paper • 2412.15188 • Published Dec 19, 2024 • 1

MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation

Paper • 2506.07999 • Published Jun 9, 2025 • 2

TV2TV: A Unified Framework for Interleaved Language and Video Generation

Paper • 2512.05103 • Published Dec 4, 2025 • 18

artificially-intelligent

updated a model 8 months ago

videotransfusion/sana_t2i_freezetext

Updated May 11, 2025

artificially-intelligent

published a model 8 months ago

videotransfusion/sana_t2i_freezetext

Updated May 11, 2025

artificially-intelligent

updated a model 8 months ago

videotransfusion/vivek

Updated May 11, 2025

artificially-intelligent

published a model 8 months ago

videotransfusion/vivek

Updated May 11, 2025

artificially-intelligent

updated a model 8 months ago

videotransfusion/i2t-v2

Updated May 8, 2025

artificially-intelligent

published a model 8 months ago

videotransfusion/i2t-v2

Updated May 8, 2025

jsingh

authored 2 papers 9 months ago

Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors

Paper • 2409.02979 • Published Sep 4, 2024 • 1

REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

Paper • 2504.10483 • Published Apr 14, 2025 • 21

jsingh

updated a model 9 months ago

videotransfusion/sit_pretrained_model-llama3-freezetext-bz512-lr1en4-v1

Updated Apr 25, 2025

jsingh

published a model 9 months ago

videotransfusion/sit_pretrained_model-llama3-freezetext-bz512-lr1en4-v1

Updated Apr 25, 2025

jsingh

updated a model 9 months ago

videotransfusion/sit_pretrained_model-llama3-freezetext-bz512-lr1en4-detailedcaption-init50k-v2

Updated Apr 25, 2025

jsingh

published a model 9 months ago

videotransfusion/sit_pretrained_model-llama3-freezetext-bz512-lr1en4-detailedcaption-init50k-v2

Updated Apr 25, 2025

swj0419

authored a paper 11 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 124

swj0419

authored a paper about 1 year ago

Negative Token Merging: Image-based Adversarial Feature Guidance

Paper • 2412.01339 • Published Dec 2, 2024 • 22

jsingh

authored a paper about 1 year ago

Negative Token Merging: Image-based Adversarial Feature Guidance

Paper • 2412.01339 • Published Dec 2, 2024 • 22

swj0419

authored a paper over 1 year ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 78

xhan77

authored a paper over 1 year ago

Can Language Models Solve Graph Problems in Natural Language?

Paper • 2305.10037 • Published May 17, 2023 • 1

AI & ML interests

Recent Activity

Team members 4

videotransfusion's activity