File size: 1,155 Bytes
f89b285
 
 
 
 
 
f518916
f89b285
 
 
 
2fb7cde
 
 
c2ab11f
 
f89b285
 
3df9704
f89b285
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
---
license: cc-by-nc-4.0
language:
- de
base_model:
- SWivid/F5-TTS
pipeline_tag: text-to-speech
---

# F5 TTS German Model

This model was trained for 4.2 million steps on the german Mozilla common voice 19.0 recordings and an internal dataset. It is designed for text-to-speech synthesis in German. \
The command to train the model is: 

accelerate launch --mixed_precision=bf16 finetune_cli.py --exp_name F5TTS_Base --learning_rate 1.8e-05 --batch_size_per_gpu 8000 --batch_size_type frame --max_samples 0 --grad_accumulation_steps 1 --max_grad_norm 1 --epochs 40 --num_warmup_updates 2000 --save_per_updates 100000 --last_per_steps 10000 --dataset_name german_speak --finetune --pretrain ckpts/german_speak/model_last.pt --tokenizer pinyin --log_samples --logger wandb


### Model Checkpoint
The checkpoint supports German and can be [downloaded here](https://huggingface.co/hvoss-techfak/F5-TTS-German/blob/main/model_f5tts_german.pt).


### Resources
- [Tutorial video to train a new language model](https://www.youtube.com/watch?v=UO4usaOojys)

### About Us
Check out our website: [SCS Bielefeld University](https://scs.techfak.uni-bielefeld.de/)