Are you planning on releasing an AWQ or GPTQ INT4 quantizations of the model?So I can serve it on VLLM.
· Sign up or log in to comment