nm-testing/Meta-Llama-3-8B-Instruct-W8A8-FP8-Channelwise-compressed-tensors Text Generation • 8B • Updated Oct 9, 2024 • 2 • 1
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a16 Text Generation • 3B • Updated Oct 23, 2024 • 3.92k • 12
RedHatAI/whisper-large-v3-turbo-FP8-dynamic Automatic Speech Recognition • 0.9B • Updated Apr 22 • 288 • 6