google/fleurs
Viewer • Updated • 768k • 57.3k • 402
How to use Scrya/whisper-medium-ms-augmented with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="Scrya/whisper-medium-ms-augmented") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("Scrya/whisper-medium-ms-augmented")
model = AutoModelForSpeechSeq2Seq.from_pretrained("Scrya/whisper-medium-ms-augmented")This model is a fine-tuned version of openai/whisper-medium on the google/fleurs dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
Training:
Evaluation:
Datasets were augmented on-the-fly using audiomentations via PitchShift and TimeStretch transformations at p=0.3.
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| 0.0876 | 2.15 | 200 | 0.1949 | 10.3105 | 3.0685 |
| 0.0064 | 4.3 | 400 | 0.1974 | 9.7004 | 2.9596 |
| 0.0014 | 6.45 | 600 | 0.2031 | 9.6190 | 2.8955 |
| 0.001 | 8.6 | 800 | 0.2058 | 9.6055 | 2.8440 |
| 0.0009 | 10.75 | 1000 | 0.2066 | 9.5784 | 2.8109 |