Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
naiweizi
/
dpo-harmless_helpful-rc_armo
like
0
Model card
Files
Files and versions
xet
Community
main
dpo-harmless_helpful-rc_armo
/
final_checkpoint
/
README.md
naiweizi
Initial upload
93a0924
verified
7 months ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
Safe
74 Bytes
Training procedure
Framework versions
PEFT 0.5.0
PEFT 0.5.0