Guilherme34/Samantha-3b-beta0.1-model (Quantized)
Description
This model is a quantized version of the original model Guilherme34/Samantha-3b-beta0.1-model.
It's quantized using the BitsAndBytes library to 4-bit using the bnb-my-repo space.
Quantization Details
- Quantization Type: int4
- bnb_4bit_quant_type: nf4
- bnb_4bit_use_double_quant: True
- bnb_4bit_compute_dtype: bfloat16
- bnb_4bit_quant_storage: uint8
馃搫 Original Model Information
BETA MODEL, ITS NOT FINISHED
DOES NOT NEED ANY SYSTEM PROMPT, you can leave empty
- Downloads last month
- 1
Model tree for Guilherme34/Samantha-3b-beta0.1-model-nf4
Base model
meta-llama/Llama-3.2-3B-Instruct
Quantized
Guilherme34/Samantha-3b-beta0.1-model