AI & ML interests
None yet
Organizations
None yet
models
22
LuyiCui/slow_fast_reason-sft-s1k-1.1_full
Text Generation
•
8B
•
Updated
•
10
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-SAPO
2B
•
Updated
•
5
LuyiCui/sft-amc_aime-R1-Distill-Qwen-1.5B
2B
•
Updated
•
14
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
LuyiCui/Qwen2.5-1.5B-Instruct-CEPO
Text Generation
•
2B
•
Updated
•
8
LuyiCui/Qwen2.5-Math-1.5B-GRPO
Updated
LuyiCui/Qwen2.5-1.5B-GRPO
Updated
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-123
Text Generation
•
2B
•
Updated
•
8
LuyiCui/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
•
15
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-3
Updated