Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
cyberpole
/
gemma-2-2b-it-reward
like
0
Follow
Cyber-pôle des révolutionnaires
4
PEFT
TensorBoard
Safetensors
trl
reward-trainer
Generated from Trainer
License:
gemma
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Use this model
77a6b57
gemma-2-2b-it-reward
/
last-checkpoint
102 MB
1 contributor
History:
82 commits
k-r-l
Training in progress, step 39, checkpoint
8a6f56a
verified
about 1 year ago
README.md
Safe
5.09 kB
Training in progress, step 1, checkpoint
about 1 year ago
adapter_config.json
751 Bytes
Training in progress, step 1, checkpoint
about 1 year ago
adapter_model.safetensors
41.6 MB
xet
Training in progress, step 39, checkpoint
about 1 year ago
optimizer.pt
21.5 MB
xet
Training in progress, step 39, checkpoint
about 1 year ago
rng_state.pth
14.2 kB
xet
Training in progress, step 39, checkpoint
about 1 year ago
scheduler.pt
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1.06 kB
xet
Training in progress, step 39, checkpoint
about 1 year ago
special_tokens_map.json
Safe
636 Bytes
Training in progress, step 1, checkpoint
about 1 year ago
tokenizer.json
Safe
34.4 MB
xet
Training in progress, step 1, checkpoint
about 1 year ago
tokenizer.model
Safe
4.24 MB
xet
Training in progress, step 1, checkpoint
about 1 year ago
tokenizer_config.json
Safe
47 kB
Training in progress, step 1, checkpoint
about 1 year ago
trainer_state.json
7.42 kB
Training in progress, step 39, checkpoint
about 1 year ago
training_args.bin
5.37 kB
xet
Training in progress, step 1, checkpoint
about 1 year ago