openslr/openslr
Updated • 600 • 28
How to use wetdog/speecht5_tts_openslr_ca with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-speech", model="wetdog/speecht5_tts_openslr_ca") # Load model directly
from transformers import AutoProcessor, AutoModelForTextToSpectrogram
processor = AutoProcessor.from_pretrained("wetdog/speecht5_tts_openslr_ca")
model = AutoModelForTextToSpectrogram.from_pretrained("wetdog/speecht5_tts_openslr_ca")This model is a fine-tuned version of microsoft/speecht5_tts on the OpenSLR dataset. It achieves the following results on the evaluation set:
This model was trained using the instructions provided on this notebook but using the catalan subset of OpenSLR dataset. The main change is the use of trimming to delete large parts of silence that this dataset originally have. You can check the notebook used for this training here
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.5039 | 8.37 | 1000 | 0.4530 |
| 0.4723 | 16.74 | 2000 | 0.4345 |
| 0.4583 | 25.1 | 3000 | 0.4316 |
| 0.4565 | 33.47 | 4000 | 0.4294 |
| 0.4363 | 41.84 | 5000 | 0.4329 |
| 0.446 | 50.21 | 6000 | 0.4331 |
| 0.4508 | 58.58 | 7000 | 0.4336 |
| 0.4529 | 66.95 | 8000 | 0.4360 |
Base model
microsoft/speecht5_tts