-
LlameUser/LeetCodeDataset-IEC61131-3-ST
Viewer • Updated • 4.61k • 6 • 2 -
LlameUser/qwen-3-4b-thinking-r1-st
Text Generation • 196k • Updated • 8 • 1 -
LlameUser/qwen-3-4b-thinking-r1-st-easy
Text Generation • 196k • Updated • 1 -
LlameUser/qwen-3-4b-thinking-r1-st-medium
Text Generation • 196k • Updated • 1
Antoine Angert
LlameUser
AI & ML interests
Large Language Models
Instruction Tuning
GRPO
Efficient Fine-Tuning (LoRA, PEFT)
Multimodal Models
Interpretability & Evaluation
AI for Scientific Research
Organizations
None yet
IEC61131-3-ST Training
-
LlameUser/LeetCodeDataset-IEC61131-3-ST
Viewer • Updated • 4.61k • 6 • 2 -
LlameUser/qwen-3-4b-thinking-r1-st
Text Generation • 196k • Updated • 8 • 1 -
LlameUser/qwen-3-4b-thinking-r1-st-easy
Text Generation • 196k • Updated • 1 -
LlameUser/qwen-3-4b-thinking-r1-st-medium
Text Generation • 196k • Updated • 1
GRPO-Countdown-Problem
models 14
LlameUser/qwen-3-4b-instruct-r1-st
Text Generation • 196k • Updated
• 3
LlameUser/qwen-3-4b-thinking-r1-st-hard
Text Generation • 196k • Updated
• 2
LlameUser/qwen-3-4b-thinking-r1-st-medium
Text Generation • 196k • Updated
• 1
LlameUser/qwen-3-4b-thinking-r1-st-easy
Text Generation • 196k • Updated
• 1
LlameUser/qwen-3-4b-thinking-r1-st
Text Generation • 196k • Updated
• 8 • 1
LlameUser/qwen-3-4b-thinking-r1-countdown
Text Generation • 196k • Updated
• 1
LlameUser/qwen-3-1.7b-r1-countdown
Text Generation • 2B • Updated
• 2
LlameUser/Qwen2.5-3B-Open-R1-GRPO
Text Generation • 3B • Updated
• 2
LlameUser/Qwen2.5-1.5B-Open-R1-GRPO
Updated
LlameUser/qwen-3-4b-instruct-r1-countdown
Text Generation • 196k • Updated
• 4