ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28, 2025 • 17.6k • 200
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4 Reinforcement Learning • 8B • Updated Mar 26, 2025 • 720 • 224
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B Reinforcement Learning • 2B • Updated Apr 6, 2025 • 67 • 1