Guilherme34/Samantha-3b-beta0.1-model (Quantized)

Description

This model is a quantized version of the original model Guilherme34/Samantha-3b-beta0.1-model.

It's quantized using the BitsAndBytes library to 4-bit using the bnb-my-repo space.

Quantization Details

  • Quantization Type: int4
  • bnb_4bit_quant_type: nf4
  • bnb_4bit_use_double_quant: True
  • bnb_4bit_compute_dtype: bfloat16
  • bnb_4bit_quant_storage: uint8

馃搫 Original Model Information

BETA MODEL, ITS NOT FINISHED

DOES NOT NEED ANY SYSTEM PROMPT, you can leave empty

Downloads last month
1
Safetensors
Model size
2B params
Tensor type
F32
BF16
U8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for Guilherme34/Samantha-3b-beta0.1-model-nf4

Quantized
(3)
this model