--- license: mit base_model: - zai-org/GLM-4.5-Air tags: - fp8 - quantized - quark - fp8_e4m3 base_model_relation: quantized --- This is an AMD Quark-quantized GLM-4.5 Air in fp8. Quantized on and for GFX1100 cards, in this case 2x W7900. This is my first quantized model and I'm still evaluating. It was calibrated with wikitext; assuming success, a future iteration will be calibrated on other datasets. Quantized perplexity on wikitext: 4.96421480178833