AWQ or GPTQ (INT4) Quantization Planned?

#2
by alonsoko - opened

Are you planning on releasing an AWQ or GPTQ INT4 quantizations of the model?
So I can serve it on VLLM.

Sign up or log in to comment