Automatic Speech Recognition
Transformers
Safetensors
English
multilingual
whisper
audio
captioning
audio-captioning
speech
voice
timbre
emotion