Add converted tokenizer (no trust_remote_code needed)
#20 opened 6 days ago
by
ArthurZ
Tokenizer JSON
👍
👀
1
1
#17 opened 3 months ago
by
rageltman
Asking to Release FP6 varient
#15 opened 3 months ago
by
MahdiFeyz
insights on comparisons with Qwen/Qwen3-Next-80B-A3B-Instruct ?
➕
6
#14 opened 3 months ago
by
saireddy
Model Architecture Naming: KDA
#11 opened 3 months ago
by
dkleine
trying to run this on a 4090 and 192GB RAM.. not enough RAM???
3
#10 opened 3 months ago
by
MikaSouthworth
tool parser?
2
#8 opened 3 months ago
by
prudant