Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
johannhartmann
's Collections
Music
Computer Use Models
Document & UI Intelligence
Multimodal Models
Medical MultiModal
Computer Use Models
updated
15 days ago
Upvote
1
ByteDance-Seed/UI-TARS-72B-DPO
Image-Text-to-Text
•
73B
•
Updated
Jan 25
•
2.29k
•
147
ByteDance-Seed/UI-TARS-7B-DPO
Image-Text-to-Text
•
8B
•
Updated
Jan 25
•
1.35k
•
221
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
488
•
1.7k
jadechoghari/Ferret-UI-Llama8b
Image-Text-to-Text
•
8B
•
Updated
Jan 8
•
278
•
68
microsoft/GUI-Actor-7B-Qwen2.5-VL
Image-Text-to-Text
•
8B
•
Updated
Aug 9
•
907
•
24
showlab/ShowUI-2B
Updated
Mar 11
•
2.41k
•
269
Zery/CUA_World_State_Model
Image-Text-to-Text
•
Updated
Aug 7
•
10
•
4
microsoft/Fara-7B
Image-Text-to-Text
•
8B
•
Updated
9 days ago
•
33.1k
•
432
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
11B
•
Updated
Apr 30
•
143k
•
1.83k
Hcompany/Holo2-30B-A3B
Image-Text-to-Text
•
31B
•
Updated
19 days ago
•
1.65k
•
36
Hcompany/Holo2-4B
Image-Text-to-Text
•
4B
•
Updated
26 days ago
•
2.79k
•
16
Hcompany/Holo2-8B
Image-Text-to-Text
•
9B
•
Updated
26 days ago
•
682
•
15
AskUI/PTA-1
Image-Text-to-Text
•
0.3B
•
Updated
Nov 28, 2024
•
837
•
97
OS-Copilot/OS-Atlas-Base-7B
Image-Text-to-Text
•
8B
•
Updated
Nov 19, 2024
•
970
•
42
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Feb 6
•
1.55M
•
•
1.24k
xlangai/OpenCUA-72B
Image-Text-to-Text
•
73B
•
Updated
28 days ago
•
208
•
4
xlangai/OpenCUA-32B
Image-Text-to-Text
•
33B
•
Updated
Aug 18
•
569
•
25
xlangai/OpenCUA-7B
Image-Text-to-Text
•
8B
•
Updated
27 days ago
•
26k
•
21
xlangai/Jedi-7B-1080p
Image-Text-to-Text
•
8B
•
Updated
Jun 18
•
112
•
29
xlangai/Jedi-3B-1080p
Image-Text-to-Text
•
4B
•
Updated
Jun 18
•
118
•
17
Qwen/Qwen3-VL-8B-Instruct
Image-Text-to-Text
•
9B
•
Updated
Oct 15
•
2.26M
•
•
532
Qwen/Qwen3-VL-8B-Thinking
Image-Text-to-Text
•
9B
•
Updated
14 days ago
•
228k
•
151
Upvote
1
Share collection
View history
Collection guide
Browse collections