Alibaba-NLP/gme-Qwen2-VL-2B-Instruct Sentence Similarity β’ 2B β’ Updated Jun 9, 2025 β’ 21.5k β’ 129
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 β’ 63
Qwen/Qwen3-Next-80B-A3B-Thinking Text Generation β’ 81B β’ Updated Sep 15, 2025 β’ 234k β’ β’ 460
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8 Text Generation β’ 235B β’ Updated Jul 30, 2025 β’ 33.7k β’ 76
Qwen/Qwen3-235B-A22B-Thinking-2507 Text Generation β’ 235B β’ Updated Aug 17, 2025 β’ 23.8k β’ β’ 391
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8 Text Generation β’ 235B β’ Updated Sep 17, 2025 β’ 519k β’ 139
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 β’ 480
view post Post 4772 Qwen 3 can launch very soon. πhttps://github.com/ggml-org/llama.cpp/pull/12828 See translation 3 replies Β· π₯ 16 16 π 9 9 β€οΈ 8 8 + Reply
Qwen/Qwen2.5-VL-32B-Instruct Image-Text-to-Text β’ 33B β’ Updated Apr 14, 2025 β’ 318k β’ β’ 473