minpeter/calculator-sft
Viewer • Updated • 1.5k • 8 • 1
Calculator tool-use model fine-tuned from Qwen3-0.6B base.
| Parameter | Value |
|---|---|
| Learning Rate | 1e-5 |
| Batch Size | 32 |
| Micro Batch Size | 8 |
| Sequence Length | 2048 |
| Steps | 50 |
| Optimizer | AdamW |
| Metric | Score |
|---|---|
| Accuracy | 99.2% |
| Avg Turns | 2.0 |
Evaluated on 158 examples with 3 rollouts each using verifiers.
Uses standard ChatML format without thinking tokens.