RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w8a8 Text Generation • 71B • Updated Jan 3 • 33
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w8a8 Text Generation • 8B • Updated Feb 27 • 45.3k • 2
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w8a8 Text Generation • 71B • Updated Feb 27 • 2.4k • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-14B-quantized.w8a8 Text Generation • 15B • Updated Feb 27 • 2.85k • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8 Text Generation • 33B • Updated Feb 27 • 1.93k • 13
RedHatAI/DeepSeek-R1-Distill-Qwen-7B-quantized.w8a8 Text Generation • 8B • Updated Feb 27 • 3.68k • 4
RedHatAI/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w8a8 Text Generation • 2B • Updated Feb 27 • 4.04k • 2
RedHatAI/Pixtral-Large-Instruct-2411-hf-quantized.w8a8 Image-Text-to-Text • 124B • Updated Mar 31 • 32
ConfidentialMind/gte-multilingual-reranker-base-onnx-op14-opt-gpu-int8 Sentence Similarity • Updated Jul 7 • 1