Running Featured 44 Distilling 100B+ Models 40x Faster with TRL 📝 44 TRL distillation for 100B+ teachers, 40x faster
bartowski/nvidia_Nemotron-Cascade-2-30B-A3B-GGUF Text Generation • 32B • Updated 24 days ago • 32.4k • 30
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated 9 days ago • 589k • 2.65k