Haitao999/Qwen2.5-NR-14B-random-natural_reasoning_simple Text Generation • 15B • Updated Jun 4, 2025 • 2
Haitao999/Qwen2.5-NR-7B-random-natural_reasoning_simple Text Generation • 8B • Updated Jun 3, 2025 • 3
Haitao999/Llama-3.1-8B-Instruct-EMPO-numia_prompt_dpo1 Text Generation • 8B • Updated May 16, 2025 • 2
Haitao999/Llama-3.2-3B-Instruct-EMPO-numia_prompt_dpo1 Text Generation • 3B • Updated May 14, 2025 • 6
Haitao999/Llama-3.2-3B-Instruct-GRPO-numia_prompt_dpo1 Text Generation • 3B • Updated May 14, 2025 • 2
Haitao999/Qwen2.5-7B-Base-EMPO-natural_reasoning_all_level Text Generation • 8B • Updated Apr 27, 2025 • 63
Haitao999/Qwen2.5-7B-Instruct-EMPO-natural_reasoning_simple-0419 Text Generation • 8B • Updated Apr 19, 2025 • 7
Haitao999/Qwen2.5-7B-Instruct-EMPO-natural_reasoning_simple_from_base_general-verifier Text Generation • 8B • Updated Apr 18, 2025 • 2