AdityaaXD/Multi-Agent_Reinforcement_Learning_Trading_System_Models Reinforcement Learning • Updated 26 days ago • 222 • 4
mradermacher/Tifa-Deepsex-14b-CoT-GGUF Reinforcement Learning • 15B • Updated Jul 31, 2025 • 373 • 21
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28, 2025 • 4.3k • 194
NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos-GGUF Reinforcement Learning • 8B • Updated May 5, 2025 • 35 • 4
mradermacher/Clado-BrowserOS-Action-i1-GGUF Reinforcement Learning • 4B • Updated 7 days ago • 4.36k • 1
ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 Reinforcement Learning • 15B • Updated Feb 13, 2025 • 1.7k • 822