AWQ quantization of DeepSeek-V2.5-1210

To run on 8xH100 80GB, you can use vLLM with:

vllm serve adamo1139/DeepSeek-V2.5-1210-AWQ --tensor-parallel 8 --trust-remote-code

Safetensors

Model size

236B params

Tensor type

I32

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for adamo1139/DeepSeek-V2.5-1210-AWQ

Base model

Quantized

(11)

this model