Volko
Volko76
AI & ML interests
Quantization, Fine-tune, Agentic Frameworks
Recent Activity
updated a model 3 days ago
Volko76/Qwen3.5-122B-A10B-UD-IQ4_XS-GGUF-MERGED published a model 5 days ago
Volko76/Qwen3.5-122B-A10B-UD-IQ4_XS-GGUF-MERGED new activity 7 days ago
Buck26/tts-french-dataset:InterestedOrganizations
Qwen2.5 Coder Base GGUF
A list of Qwen2.5 Coder base quantized in GGUF
GGUF Quantizations
A CPU + GPU support type of quantization. It's currently the most used quantization method. Read more here : https://github.com/ggerganov/llama.cpp
-
Volko76/Qwen2.5-Coder-0.5B-Instruct-GGUF
Text Generation • 0.5B • Updated • 9 -
Volko76/Qwen2.5-Coder-1.5B-Instruct-GGUF
Text Generation • 2B • Updated • 19 -
Volko76/Qwen2.5-Coder-3B-Instruct-GGUF
Text Generation • 3B • Updated • 13 -
Volko76/Qwen2.5-Coder-7B-Instruct-GGUF
Text Generation • 8B • Updated • 50
Qwen2.5 Coder Instruct GGUF
A list of Qwen2.5 Coder quantized in GGUF
-
Volko76/Qwen2.5-Coder-0.5B-Instruct-GGUF
Text Generation • 0.5B • Updated • 9 -
Volko76/Qwen2.5-Coder-1.5B-Instruct-GGUF
Text Generation • 2B • Updated • 19 -
Volko76/Qwen2.5-Coder-3B-Instruct-GGUF
Text Generation • 3B • Updated • 13 -
Volko76/Qwen2.5-Coder-7B-Instruct-GGUF
Text Generation • 8B • Updated • 50
OpenCoder GGUF
A complete open source small coding model quantized in GGUF
EXL2 Quantizations
A collection of models quantized for EXL2, one of the fastest quantisation method. https://github.com/turboderp/exllamav2
-
Volko76/Qwen2.5-Coder-0.5B-Instruct-1.0bpw-exl2
Text Generation • Updated -
Volko76/Qwen2.5-Coder-0.5B-Instruct-2.0bpw-exl2
Text Generation • Updated -
Volko76/Qwen2.5-Coder-0.5B-Instruct-3.0bpw-exl2
Text Generation • Updated -
Volko76/Qwen2.5-Coder-0.5B-Instruct-4.5bpw-exl2
Text Generation • Updated
EXL3 Quantizations
Qwen2.5 Coder Instruct GGUF
A list of Qwen2.5 Coder quantized in GGUF
-
Volko76/Qwen2.5-Coder-0.5B-Instruct-GGUF
Text Generation • 0.5B • Updated • 9 -
Volko76/Qwen2.5-Coder-1.5B-Instruct-GGUF
Text Generation • 2B • Updated • 19 -
Volko76/Qwen2.5-Coder-3B-Instruct-GGUF
Text Generation • 3B • Updated • 13 -
Volko76/Qwen2.5-Coder-7B-Instruct-GGUF
Text Generation • 8B • Updated • 50
Qwen2.5 Coder Base GGUF
A list of Qwen2.5 Coder base quantized in GGUF
OpenCoder GGUF
A complete open source small coding model quantized in GGUF
GGUF Quantizations
A CPU + GPU support type of quantization. It's currently the most used quantization method. Read more here : https://github.com/ggerganov/llama.cpp
-
Volko76/Qwen2.5-Coder-0.5B-Instruct-GGUF
Text Generation • 0.5B • Updated • 9 -
Volko76/Qwen2.5-Coder-1.5B-Instruct-GGUF
Text Generation • 2B • Updated • 19 -
Volko76/Qwen2.5-Coder-3B-Instruct-GGUF
Text Generation • 3B • Updated • 13 -
Volko76/Qwen2.5-Coder-7B-Instruct-GGUF
Text Generation • 8B • Updated • 50
EXL2 Quantizations
A collection of models quantized for EXL2, one of the fastest quantisation method. https://github.com/turboderp/exllamav2
-
Volko76/Qwen2.5-Coder-0.5B-Instruct-1.0bpw-exl2
Text Generation • Updated -
Volko76/Qwen2.5-Coder-0.5B-Instruct-2.0bpw-exl2
Text Generation • Updated -
Volko76/Qwen2.5-Coder-0.5B-Instruct-3.0bpw-exl2
Text Generation • Updated -
Volko76/Qwen2.5-Coder-0.5B-Instruct-4.5bpw-exl2
Text Generation • Updated