The design philosophy and core architecture of PaddleSpeech is described to support several essential speech- to-text and text-to-speech tasks to achieve competitive or state-of-the-art performance on various speech datasets.
PaddleSpeech is an open-source all-in-one speech toolkit. It aims at facilitating the development and research of speech processing technologies by providing an easy-to-use command-line interface and a simple code structure. This paper describes the design philosophy and core architecture of PaddleSpeech to support several essential speech-to-text and text-to-speech tasks. PaddleSpeech achieves competitive or state-of-the-art performance on various speech datasets and implements the most popular methods. It also provides recipes and pretrained models to quickly reproduce the experimental results in this paper. PaddleSpeech is publicly avaiable at https://github.com/PaddlePaddle/PaddleSpeech.
Liang Huang
3 papers
Hui Zhang
1 papers
Tian Yuan
2 papers
Yuxin Huang
1 papers
Xiaojie Chen
1 papers
Enlei Gong
1 papers
Zeyu Chen
2 papers
Xiaoguang Hu
1 papers
Dianhai Yu
1 papers
Yanjun Ma
1 papers