Quark-NPU-Workshop/Hermes-3-Llama-3.2-3B-awq-g128-int4-asym-bf16-onnx-hybrid 0.8B • Updated Oct 8 • 4