metadata
license: apache-2.0
base_model:
- Qwen/Qwen3-VL-2B-Instruct
tags:
- autoround
- nvfp4
This is Qwen/Qwen3-VL-8B-Instruct quantized with AutoRound in W4A16 (GPTQ format). The model has been created, tested, and evaluated by The Kaitchup. The model is NOT compatible with vLLM (as of v0.11).
- Developed by: The Kaitchup
- License: Apache 2.0 license
How to Support My Work
Subscribe to The Kaitchup. This helps me a lot to continue quantizing and evaluating models for free. Or you prefer to give some GPU hours, "buy me a coffee"