--- license: mit base_model: MiniMaxAI/MiniMax-M2 base_model_relation: quantized quantized_by: turboderp tags: - exl3 --- EXL3 quants of [MiniMax-M2](https://huggingface.co/MiniMaxAI/MiniMax-M2) ⚠️ Requires ExLlamaV3 v0.0.12 (or v0.0.11 `dev` branch) Base bitrates: [2.00 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/2.0bpw) [3.00 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/3.0bpw) [4.00 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/4.0bpw) Optimized: [2.04 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/2.04bpw) [2.27 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/2.27bpw) [3.04 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/3.04bpw) [3.50 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/3.5bpw) [4.03 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/4.03bpw) . | KL-div | ppl | HumanEval@1 ---------|--------|-------|------------- 2.00 bpw | 0.400 | 10.92 | 80.5% 2.04 bpw | 0.297 | 10.23 | 87.1% 2.27 bpw | 0.252 | 9.78 | 88.4% 3.00 bpw | 0.141 | 8.99 | 87.8% 3.04 bpw | 0.117 | 8.73 | 87.2% 3.50 bpw | 0.094 | 8.78 | 88.4% 4.00 bpw | 0.087 | 8.58 | 89.6% 4.03 bpw | 0.077 | 8.61 | 87.8% original | - | 8.51 | 87.2%¹ ¹ Unconfirmed
2.00 bpw
2.00 bpw
2.04 bpw
2.04 bpw
2.27 bpw
2.27 bpw
3.00 bpw
3.00 bpw
3.04 bpw
3.04 bpw
3.50 bpw
3.50 bpw
4.00 bpw
4.00 bpw
4.00 bpw
4.03 bpw
API
API