Running on Zero 8 Qwen3-VL Multimodal Search Engine 🔥 8 Cross-modal text-image search powered by Qwen3-VL
huihui-ai/Huihui-Qwen3-VL-8B-Instruct-abliterated Image-Text-to-Text • 9B • Updated Dec 15, 2025 • 5.55k • 139
fancyfeast/llama-joycaption-beta-one-hf-llava Image-Text-to-Text • 8B • Updated May 16, 2025 • 67.5k • 298
huihui-ai/Huihui-MiniCPM-V-4_5-abliterated Image-Text-to-Text • 9B • Updated Sep 8, 2025 • 6.17k • 27
Running on Zero Featured 880 Joy Caption Beta One 🖼 880 Generate captions for images with various styles and options
HuggingFaceTB/SmolVLM2-500M-Video-Instruct Image-Text-to-Text • 0.5B • Updated Apr 8, 2025 • 84.8k • 113
Runtime error Featured 198 Better Florence 2 🔥 198 Analyze images to detect objects, generate captions, or perform OCR