AngelRaychev commited on
Commit
1751c0a
·
verified ·
1 Parent(s): 45a60ac

End of training

Browse files
Files changed (4) hide show
  1. README.md +2 -2
  2. loss_plot_policy.png +0 -0
  3. model.safetensors +1 -1
  4. training_args.bin +1 -1
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- base_model: AngelRaychev/0.5B-policy-iteration_0
3
  library_name: transformers
4
  model_name: 0.5B-policy-iteration_1
5
  tags:
@@ -11,7 +11,7 @@ licence: license
11
 
12
  # Model Card for 0.5B-policy-iteration_1
13
 
14
- This model is a fine-tuned version of [AngelRaychev/0.5B-policy-iteration_0](https://huggingface.co/AngelRaychev/0.5B-policy-iteration_0).
15
  It has been trained using [TRL](https://github.com/huggingface/trl).
16
 
17
  ## Quick start
 
1
  ---
2
+ base_model: AngelRaychev/0.5B-policy-iteration_1
3
  library_name: transformers
4
  model_name: 0.5B-policy-iteration_1
5
  tags:
 
11
 
12
  # Model Card for 0.5B-policy-iteration_1
13
 
14
+ This model is a fine-tuned version of [AngelRaychev/0.5B-policy-iteration_1](https://huggingface.co/AngelRaychev/0.5B-policy-iteration_1).
15
  It has been trained using [TRL](https://github.com/huggingface/trl).
16
 
17
  ## Quick start
loss_plot_policy.png ADDED
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9057e3e678dffede68bd65fcbe3e2740abb283a3d507bb880f3fdac6d1657fe9
3
  size 1976163472
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f53b6d54f916f0d547b7b432a9f34659d5639ed012396c0e7602812ea5b98af1
3
  size 1976163472
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3932ac3581dba35a3b48d6dd40df98b4597020cc475ff21b1fd9d433947a5286
3
  size 5688
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:60e067ff54ef830b00c15c59997283dbf984bbc8a1e55abb2d8c11711f7d530e
3
  size 5688