AWQ quantization of DeepSeek-V2.5-1210

To run on 8xH100 80GB, you can use vLLM with:

vllm serve adamo1139/DeepSeek-V2.5-1210-AWQ --tensor-parallel 8 --trust-remote-code
Downloads last month
11
Safetensors
Model size
236B params
Tensor type
I32
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for adamo1139/DeepSeek-V2.5-1210-AWQ

Quantized
(11)
this model