·
AI & ML interests
None yet
Organizations
Dongwei/Qwen-2.5-7B_Base_Math_smalllr_newdata
Text Generation
• 8B • Updated
• 2
Dongwei/Qwen-2.5-7B_Base_Math_smalllr_longer
Text Generation
• 8B • Updated
• 1
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr
Text Generation
• 8B • Updated
• 1
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
Text Generation
• 8B • Updated
• 5
Dongwei/Qwen-2.5-7B_Base_Math_smalllr
Text Generation
• 8B • Updated
• 4
• 6
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr
Text Generation
• 8B • Updated
• 2
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr
Text Generation
• 2B • Updated
• 2
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr
Text Generation
• 2B • Updated
Dongwei/Qwen-2.5-7B_Math_smalllr
Text Generation
• 8B • Updated
• 2
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math
Text Generation
• 8B • Updated
• 7
Text Generation
• 8B • Updated
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math
Text Generation
• 2B • Updated
• 2
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math
Text Generation
• 2B • Updated
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO
Text Generation
• 8B • Updated
• 10
• 1
Text Generation
• 8B • Updated
• 1
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated
• 2
• 1
Dongwei/Rationalyst_reasoning_datasets
Text Generation
• 8B • Updated
• 52
• 4