End of training

Files changed (6) hide show

README.md CHANGED Viewed

@@ -3,6 +3,9 @@ library_name: transformers
 license: cc-by-nc-4.0
 base_model: facebook/mms-1b-all
 tags:
 - generated_from_trainer
 model-index:
 - name: mms-1b-all-swagen-combined-m50f50-dnn-42-0.09
@@ -14,10 +17,10 @@ should probably proofread and complete it, then remove this comment. -->
 # mms-1b-all-swagen-combined-m50f50-dnn-42-0.09
-This model is a fine-tuned version of [facebook/mms-1b-all](https://huggingface.co/facebook/mms-1b-all) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2732
-- Cer: 0.0598
 ## Model description

 license: cc-by-nc-4.0
 base_model: facebook/mms-1b-all
 tags:
+- automatic-speech-recognition
+- swagen
+- mms
 - generated_from_trainer
 model-index:
 - name: mms-1b-all-swagen-combined-m50f50-dnn-42-0.09
 # mms-1b-all-swagen-combined-m50f50-dnn-42-0.09
+This model is a fine-tuned version of [facebook/mms-1b-all](https://huggingface.co/facebook/mms-1b-all) on the SWAGEN - SWA dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2722
+- Cer: 0.0590
 ## Model description

adapter.swa.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:bc3e42a25705947ce7d40942dd660b8a2ecffa35f690ae78ab9d6e349a496a3d
+size 8819028

all_results.json ADDED Viewed

+{
+    "epoch": 13.782608695652174,
+    "eval_cer": 0.05900823109875076,
+    "eval_loss": 0.27218547463417053,
+    "eval_runtime": 43.0028,
+    "eval_samples": 693,
+    "eval_samples_per_second": 16.115,
+    "eval_steps_per_second": 4.046,
+    "total_flos": 1.6964840334266591e+19,
+    "train_loss": 1.873124790736607,
+    "train_runtime": 2219.7426,
+    "train_samples": 2020,
+    "train_samples_per_second": 13.65,
+    "train_steps_per_second": 0.858
+}

eval_results.json ADDED Viewed

+{
+    "epoch": 13.782608695652174,
+    "eval_cer": 0.05900823109875076,
+    "eval_loss": 0.27218547463417053,
+    "eval_runtime": 43.0028,
+    "eval_samples": 693,
+    "eval_samples_per_second": 16.115,
+    "eval_steps_per_second": 4.046
+}

train_results.json ADDED Viewed

+{
+    "epoch": 13.782608695652174,
+    "total_flos": 1.6964840334266591e+19,
+    "train_loss": 1.873124790736607,
+    "train_runtime": 2219.7426,
+    "train_samples": 2020,
+    "train_samples_per_second": 13.65,
+    "train_steps_per_second": 0.858
+}

trainer_state.json ADDED Viewed

The diff for this file is too large to render. See raw diff