Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
amd
/
Llama-2-70b-chat-hf-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8
like
0
Follow
AMD
2.1k
Safetensors
llama
quark
License:
llama2
Model card
Files
Files and versions
xet
Community
4
main
Llama-2-70b-chat-hf-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8
/
tokenizer.json
XuebinWang
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (
#2
)
b7eaab4
verified
2 months ago
raw
Copy download link
history
contribute
delete
Safe
3.62 MB
File too large to display, you can
check the raw version
instead.