Home Research Papers Datasets State of the Art Pricing

Discover, visualize, and connect AI research papers. Explore the latest trends and insights in artificial intelligence research.

Product

Home
Research Papers
About

Support

Contact
Terms of Service
Privacy Policy

© 2026 Papersgraph. All rights reserved.

speech-16

Speech Recognition

3260 papers • 126 benchmarks • 313 datasets

Speech Recognition is the task of converting spoken language into text. It involves recognizing the words spoken in an audio recording and transcribing them into a written format. The goal is to accurately transcribe the speech in real-time or from recorded audio, taking into account factors such as accents, speaking speed, and background noise. ( Image credit: SpecAugment )

(Image credit: Open Source)

Speech Recognition

Benchmarks

These leaderboards are used to track progress in speech-recognition-16

Trend

Dataset

Best Model

Actions

LibriSpeech test-clean

LibriSpeech test-clean

LibriSpeech test-other

LibriSpeech test-other

TIMIT

Libraries

i

Use these libraries to find speech-recognition-16 models and implementations

msalhab96/SpeeQ

13 papers 29

Datasets

No datasets available.

Subtasks

Automatic Speech Recognition (ASR)Visual Speech Recognition Robust Speech Recognition Distant Speech Recognition Distant Speech Recognition

Most implemented papers

Content

Introduction Benchmarks Datasets Subtasks Libraries Papers

TIMIT

Switchboard + Hub500

Switchboard + Hub500

Common Voice German

Common Voice German

WSJ eval92

WSJ eval92

MediaSpeech

MediaSpeech

TUDA

TUDA

SLUE

SLUE

swb_hub_500 WER fullSWBCH

swb_hub_500 WER fullSWBCH

AISHELL-1

AISHELL-1

Common Voice Spanish

Common Voice Spanish

Common Voice French

Common Voice French

WenetSpeech

WenetSpeech

Hub5'00 SwitchBoard

Hub5'00 SwitchBoard

Libri-Light test-clean

Libri-Light test-clean

Libri-Light test-other

Libri-Light test-other

EasyCom

EasyCom

WSJ dev93

WSJ dev93

WSJ eval93

WSJ eval93

Fongbe audio

Fongbe audio

CHiME-6 dev_gss12

CHiME-6 dev_gss12

Common Voice

Common Voice

VIVOS

VIVOS

Common Voice vi

Common Voice vi

AMI SDM1

AMI SDM1

Tedlium

Tedlium

Europarl-ASR EN Guest-test

Europarl-ASR EN Guest-test

Europarl-ASR EN MEP-test

Europarl-ASR EN MEP-test

Speech Commands

Speech Commands

LRS3-TED

LRS3-TED

Switchboard (300hr)

Switchboard (300hr)

Hub5'00 CallHome

Hub5'00 CallHome

Hub5'00 FISHER-SWBD

Hub5'00 FISHER-SWBD

LibriSpeech train-clean-100 test-clean

LibriSpeech train-clean-100 test-clean

LibriSpeech train-clean-100 test-other

LibriSpeech train-clean-100 test-other

Common Voice Portuguese

Common Voice Portuguese

Common Voice Italian

Common Voice Italian

SPGISpeech

SPGISpeech

GigaSpeech

GigaSpeech

GigaSpeech DEV

GigaSpeech DEV

GigaSpeech TEST

GigaSpeech TEST

AMI IMH

AMI IMH

Switchboard SWBD

Switchboard SWBD

Switchboard CallHome

Switchboard CallHome

CHiME-6 eval

CHiME-6 eval

Google Speech Commands - Musan

Google Speech Commands - Musan

Vox Populi

Vox Populi

Artie Bias Corpus

Artie Bias Corpus

Fleurs (English)

Fleurs (English)

CHiME6

CHiME6

WSJ

WSJ

AMI-IHM

AMI-IHM

CALLHOME

CALLHOME

Switchboard corpus

Switchboard corpus

CORAAL

CORAAL

LRS2

LRS2

LibriCSS

LibriCSS

12 papers 6,221

PaddlePaddle/PaddleSpeech

11 papers 6,410

pytorch/fairseq

10 papers 21,285

mravanelli/pytorch-kaldi

8 papers 2,276

huggingface/transformers

7 papers 85,805

TensorSpeech/TensorFlowASR

6 papers 816

6 papers 277

facebookresearch/fairseq

5 papers 21,269

Alexander-H-Liu/End-to-end-ASR-Pyto…

5 papers 1,082

4 papers 5,913

rwth-i6/returnn

4 papers 335

microsoft/speecht5

3 papers 401

microsoft/unilm

2 papers 11,172

2 papers 2,775

alibaba-damo-academy/FunASR

2 papers 270

Sequence-To-Sequence Speech Recognition

Target Speaker Extraction

Accented Speech Recognition

Noisy Speech Recognition

English Conversational Speech Recognition

Adding a benchmark result helps the community track progress.

Speech Recognition | State-of-the-Art