Home Research Papers Datasets State of the Art Pricing

Discover, visualize, and connect AI research papers. Explore the latest trends and insights in artificial intelligence research.

Product

Home
Research Papers
About

Support

Contact
Terms of Service
Privacy Policy

natural-language-processing-16

Text-To-Speech Synthesis

3260 papers • 126 benchmarks • 313 datasets

Text-To-Speech Synthesis is a machine learning task that involves converting written text into spoken words. The goal is to generate synthetic speech that sounds natural and resembles human speech as closely as possible.

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in text-to-speech-synthesis-48

Trend

Dataset

Best Model

Actions

LJSpeech

CMUDict 0.7b

20000 utterances

Libraries

Use these libraries to find text-to-speech-synthesis-48 models and implementations

PaddlePaddle/PaddleSpeech

12 papers 9,288

Datasets

No datasets available.

Subtasks

Prosody Prediction Zero-Shot Multi-Speaker TTS

Most implemented papers

Content

Introduction Benchmarks Datasets Subtasks Libraries Papers

HUI speech corpus

Thorsten voice 21.02 neutral

coqui-ai/TTS

10 papers 23,793

keonlee9420/Expressive-FastSpeech2

5 papers 244

TensorSpeech/TensorflowTTS

4 papers 3,571

CorentinJ/Real-Time-Voice-Cloning

3 papers 49,053

NVIDIA/radtts

3 papers 258

keonlee9420/STYLER

3 papers 142

tigthor/Voice-Cloning-AI

3 papers 26

dathudeptrai/TensorflowTTS

3 papers 12

PaddlePaddle/DeepSpeech

2 papers 9,278

MoonInTheRiver/DiffSinger

2 papers 3,860

Adding a benchmark result helps the community track progress.

Text-To-Speech Synthesis | State-of-the-Art