3260 papers • 126 benchmarks • 313 datasets
Speech Recognition is the task of converting spoken language into text. It involves recognizing the words spoken in an audio recording and transcribing them into a written format. The goal is to accurately transcribe the speech in real-time or from recorded audio, taking into account factors such as accents, speaking speed, and background noise. ( Image credit: SpecAugment )
(Image credit: Open Source)
These leaderboards are used to track progress in speech-recognition-16
Use these libraries to find speech-recognition-16 models and implementations
No datasets available.
Adding a benchmark result helps the community track progress.