amd/DeepSeek-R1-Distill-Qwen-7B-awq-asym-uint4-g128-lmhead-onnx-hybrid Updated Sep 16, 2025 • 18 • 4
amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-hybrid Updated Sep 16, 2025 • 10 • 1
amd/Qwen2.5-1.5B-Instruct-awq-uint4-asym-g128-lmhead-g32-fp16-onnx-hybrid Text Generation • Updated Sep 16, 2025 • 7
amd/Qwen2.5-3B-Instruct-awq-uint4-asym-g128-lmhead-g32-fp16-onnx-hybrid Text Generation • Updated Sep 16, 2025 • 9 • 1
amd/Qwen2.5-7B-Instruct-awq-uint4-asym-g128-lmhead-g32-fp16-onnx-hybrid Text Generation • Updated Sep 16, 2025 • 10
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-fp16-onnx-hybrid Text Generation • Updated Sep 16, 2025 • 9
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-fp16-onnx-hybrid Text Generation • Updated Sep 16, 2025 • 6
amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Aug 27, 2025 • 10 • 2
amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-fp16-onnx-hybrid Text Generation • Updated Aug 27, 2025 • 5
amd/Auto-Mixed-Precision-Mixtral-8x7B-Instruct-v0.1-Weight-Activation-Mixed-MXFP4-FP8PT-KVFP8 Updated Aug 26, 2025
amd/Llama-2-70b-chat-hf-WMXFP4-AMXFP4-KVFP8-Scale-UINT8-MLPerf-GPTQ 37B • Updated Aug 5, 2025 • 7