3 25 44

Ayman Khan

AymanKing

amugoodbad229

AI & ML interests

None yet

Recent Activity

liked a Space 4 days ago

Tongyi-MAI/Z-Image-Turbo

liked a Space 4 days ago

webml-community/Supertonic-TTS-WebGPU

upvoted an article 4 days ago

Transformers v5: Simple model definitions powering the AI ecosystem

View all activity

Organizations

None yet

upvoted an article 4 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

8 days ago

•

227

upvoted a paper 5 days ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published 20 days ago • 222

upvoted an article 24 days ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

•

1.15k

upvoted an article 28 days ago

Article

Streaming datasets: 100x More Efficient

Oct 27

•

upvoted a paper about 2 months ago

Quantum-PEFT: Ultra parameter-efficient fine-tuning

Paper • 2503.05431 • Published Mar 7 • 1

upvoted 2 papers 2 months ago

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11 • 239

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6 • 116

upvoted 3 papers 4 months ago

Ovis2.5 Technical Report

Paper • 2508.11737 • Published Aug 15 • 111

DINOv3

Paper • 2508.10104 • Published Aug 13 • 285

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

Paper • 2508.04825 • Published Aug 6 • 58

upvoted 4 papers 5 months ago

MOSPA: Human Motion Generation Driven by Spatial Audio

Paper • 2507.11949 • Published Jul 16 • 24

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

Paper • 2507.13344 • Published Jul 17 • 57

EmbRACE-3K: Embodied Reasoning and Action in Complex Environments

Paper • 2507.10548 • Published Jul 14 • 36

FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers

Paper • 2507.12956 • Published Jul 17 • 24

upvoted an article 5 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

•

722

upvoted a paper 8 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 303

upvoted a collection 8 months ago

Wan2.1 14B 480p I2V LoRAs

Collection

A collection of Remade's Wan2.1 14B 480p I2V LoRAs • 49 items • Updated May 24 • 208

upvoted 3 papers 8 months ago

EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer

Paper • 2503.07027 • Published Mar 10 • 29

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

Paper • 2503.12533 • Published Mar 16 • 68

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 117

Ayman Khan

AI & ML interests

Recent Activity

Organizations

AymanKing's activity

Transformers v5: Simple model definitions powering the AI ecosystem

Introducing smolagents: simple agents that write actions in code.

Streaming datasets: 100x More Efficient

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders