7 9 3

Xinggang Wang

xinggangw

https://xwcv.github.io

AI & ML interests

computer vision

Recent Activity

upvoted a paper 11 days ago

UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving

authored a paper 28 days ago

Mixture-of-Depths Attention

upvoted a paper 28 days ago

Mixture-of-Depths Attention

View all activity

Organizations

upvoted a paper 11 days ago

UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving

Paper • 2604.02190 • Published 13 days ago • 27

authored a paper 28 days ago

Mixture-of-Depths Attention

Paper • 2603.15619 • Published 29 days ago • 80

upvoted a paper 28 days ago

Mixture-of-Depths Attention

Paper • 2603.15619 • Published 29 days ago • 80

upvoted a paper 3 months ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published Dec 15, 2025 • 106

upvoted a paper 4 months ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 89

authored 2 papers 4 months ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published Dec 15, 2025 • 106

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Paper • 2512.08829 • Published Dec 9, 2025 • 21

authored a paper 11 months ago

PixelHacker: Image Inpainting with Structural and Semantic Consistency

Paper • 2504.20438 • Published Apr 29, 2025 • 44

authored a paper about 1 year ago

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

Paper • 2503.10596 • Published Mar 13, 2025 • 18

upvoted a paper about 1 year ago

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

Paper • 2503.10596 • Published Mar 13, 2025 • 18

authored 2 papers about 1 year ago

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models

Paper • 2503.08686 • Published Mar 11, 2025 • 19

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published Mar 10, 2025 • 23

upvoted 2 papers about 1 year ago

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published Mar 10, 2025 • 23

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Paper • 2502.13144 • Published Feb 18, 2025 • 38

authored a paper about 1 year ago

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Paper • 2502.13144 • Published Feb 18, 2025 • 38

updated a model about 1 year ago

hustvl/mmMamba-linear

Image-Text-to-Text • 3B • Updated Feb 26, 2025 • 8 • 5

New activity in hustvl/mmMamba-linear about 1 year ago

Add metadata tags and link to code

#1 opened about 1 year ago by

nielsr

liked a model about 1 year ago

hustvl/mmMamba-linear

Image-Text-to-Text • 3B • Updated Feb 26, 2025 • 8 • 5

upvoted a paper about 1 year ago

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Paper • 2502.13145 • Published Feb 18, 2025 • 38

New activity in hustvl/vavae-imagenet256-f16d32-dinov2 about 1 year ago

Add pipeline tag, link to paper

#1 opened over 1 year ago by

nielsr

Xinggang Wang

AI & ML interests

Recent Activity

Organizations

xinggangw's activity

Add metadata tags and link to code

Add pipeline tag, link to paper