MiroThinker-v0.2 Collection Better performance in multi-hop search and multilingual tasks. • 8 items • Updated Nov 9 • 7
My favorite model list (UPDATED) Collection Here I have collected models that I have enjoyed and that I actively use for RP or creativity • 10 items • Updated Aug 2 • 10
✨SimpleChat Collection The SimpleChat series represents our new exploration into Non-Chain-of-Thought (Non-CoT) models. Designed to be concise, rational, and empathetic. • 5 items • Updated Sep 3 • 3
Apertus LLM Collection Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1 • 308
view article Article From Zero to AI: Build Your First Language Model in 5 Minutes with Google's Gemma Jul 31 • 5
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale Paper • 2505.03005 • Published May 5 • 36
Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated Mar 25 • 65
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Jul 21 • 125
AI4Privacy_v2 Collection Collection for AI4Privacy Version 2 trained on PII200k • 6 items • Updated Sep 25, 2024 • 4