Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
tbhot3ww
/
Llama-3.2-1B-Instruct-NVFP4
like
0
Safetensors
llama
nvfp4
quantized
vllm
hopper
dgx
8-bit precision
modelopt
License:
llama3.2
Model card
Files
Files and versions
xet
Community
main
Llama-3.2-1B-Instruct-NVFP4
1.09 GB
1 contributor
History:
3 commits
tbhot3ww
Create README.md
6f92d39
verified
11 days ago
.gitattributes
Safe
1.57 kB
NVFP4 export (modelopt_fp4) for vLLM
11 days ago
README.md
Safe
1.34 kB
Create README.md
11 days ago
chat_template.jinja
Safe
3.83 kB
NVFP4 export (modelopt_fp4) for vLLM
11 days ago
config.json
Safe
1.89 kB
NVFP4 export (modelopt_fp4) for vLLM
11 days ago
generation_config.json
Safe
184 Bytes
NVFP4 export (modelopt_fp4) for vLLM
11 days ago
hf_quant_config.json
Safe
268 Bytes
NVFP4 export (modelopt_fp4) for vLLM
11 days ago
model.safetensors
1.07 GB
xet
NVFP4 export (modelopt_fp4) for vLLM
11 days ago
special_tokens_map.json
Safe
325 Bytes
NVFP4 export (modelopt_fp4) for vLLM
11 days ago
tokenizer.json
Safe
17.2 MB
xet
NVFP4 export (modelopt_fp4) for vLLM
11 days ago
tokenizer_config.json
Safe
50.6 kB
NVFP4 export (modelopt_fp4) for vLLM
11 days ago