Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
Haokai Zhao
jz666
Follow
AI & ML interests
None yet
Recent Activity
updated
a dataset
12 days ago
jz666/gemma2-ultrafeedback-templated-ppl-order-5
published
a dataset
12 days ago
jz666/gemma2-ultrafeedback-templated-ppl-order-5
updated
a dataset
20 days ago
jz666/gemma2-ultrafeedback-templated-ppl-margin-5
View all activity
Organizations
None yet
models
16
Sort: Recently updated
jz666/dpo-grad-acc-128-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 21
•
5
jz666/dpo-grad-acc-32-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 21
•
4
jz666/dpo-grad-acc-64-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 21
•
6
jz666/dpo-grad-acc-16-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 21
•
3
jz666/gemma-2-9b-it-dpo-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 20
•
4
jz666/gemma-2-9b-it-simpo-split-10-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 17
•
3
jz666/simpo-train-large-wrong
Text Generation
•
9B
•
Updated
Oct 16
•
8
jz666/simpo-train-filtered-full
Text Generation
•
9B
•
Updated
Oct 14
•
4
jz666/simpo-train-large-correct
Text Generation
•
9B
•
Updated
Oct 14
•
5
jz666/simpo-train-small-wrong
Text Generation
•
9B
•
Updated
Oct 14
•
3
View 16 models
datasets
15
Sort: Recently updated
jz666/gemma2-ultrafeedback-templated-ppl-order-5
Updated
12 days ago
•
32
jz666/gemma2-ultrafeedback-templated-ppl-margin-5
Viewer
•
Updated
20 days ago
•
180k
•
102
jz666/gemma2-ultrafeedback-templated-ppl
Viewer
•
Updated
21 days ago
•
61.5k
•
29
jz666/gemma2-ultrafeedback-split-length
Viewer
•
Updated
28 days ago
•
121k
•
39
jz666/gemma2-ultrafeedback-ppl-split-70
Viewer
•
Updated
Oct 18
•
483k
•
38
jz666/gemma2-ultrafeedback-ppl-split-90
Viewer
•
Updated
Oct 18
•
553k
•
29
jz666/gemma2-ultrafeedback-ppl-split-80
Viewer
•
Updated
Oct 18
•
518k
•
42
jz666/gemma2-ultrafeedback-ppl-split-60
Viewer
•
Updated
Oct 18
•
448k
•
48
jz666/gemma2-ultrafeedback-ppl-split-50
Viewer
•
Updated
Oct 18
•
411k
•
44
jz666/gemma2-ultrafeedback-ppl-split-40
Viewer
•
Updated
Oct 17
•
355k
•
45
View 15 datasets