speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment (2021-04-03T00:00:00.000000Z)

TL;DR

This paper introduces a new open-source speech corpus, consisting of 5000 English utterances from 250 non-native speakers, where half of the speakers are children, designed for pronunciation assessment use.

Abstract

This paper introduces a new open-source speech corpus named"speechocean762"designed for pronunciation assessment use, consisting of 5000 English utterances from 250 non-native speakers, where half of the speakers are children. Five experts annotated each of the utterances at sentence-level, word-level and phoneme-level. A baseline system is released in open source to illustrate the phoneme-level pronunciation assessment workflow on this corpus. This corpus is allowed to be used freely for commercial and non-commercial purposes. It is available for free download from OpenSLR, and the corresponding baseline system is published in the Kaldi speech recognition toolkit.

Authors

Junbo Zhang

4 papers

Zhiwen Zhang

1 papers

Yongqing Wang

2 papers

TL;DR

Abstract

Authors

References33 items

Mixtures of Deep Neural Experts for Automated Speech Scoring

The Training Skills of College Students’ Oral English Based on the Computer-aided Language Learning Environment

An End-to-End Mispronunciation Detection System for L2 English Speech Leveraging Novel Anti-Phone Modeling

SED-MDD: Towards Sentence Dependent End-To-End Mispronunciation Detection and Diagnosis

Using spoken language technology for generating feedback to prepare for the TOEFL iBT® test: a user perception study

TLT-school: a Corpus of Non Native Children Speech

Sell-corpus: an Open Source Multiple Accented Chinese-english Speech Corpus for L2 English Learning Assessment

L2-ARCTIC: A Non-native English Speech Corpus

Automatic lexical stress and pitch accent detection for L2 English speech using multi-distribution deep neural networks

Intonation classification for L2 English speech using multi-distribution deep neural networks

Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers

Mispronunciation detection and diagnosis in l2 english speech using multi-distribution Deep Neural Networks

On Optimization of Non-Intelligence Factors in College English Teaching in Computer-Aided Language Learning Environments

EduSpeak®: A speech recognition and pronunciation scoring toolkit for computer-aided language learning applications

The ISLE Corpus of Non-Native Spoken English

Support Vector Regression Machines

Improving English Phoneme Pronunciation with Automatic Speech Recognition Using Voice Chatbot

Nuance in the Noise: The Complex Reality of Teacher Shortages.

University of Birmingham Overview of the 2018 spoken CALL shared task

iCALL corpus: Mandarin Chinese spoken by non-native speakers of European descent

Lexical stress detection for L2 English speech using deep belief networks

Singapore Mandarin: Its Positioning, Internal Structure and Corpus Planning

The Automatic Assessment of Non-native Prosody: Combining Classical Prosodic Analysis with Acoustic Modelling

Design and Collection of an L2 English Corpus with a Suprasegmental Focus for Chinese Learners of English

Improvement of Segmental Mispronunciation Detection with Prior Knowledge Extracted from Large L2 Speech Corpus

Speech Recognition with Weighted Finite-State Transducers

Development of Japanese Speech Database Read by Non-native Speakers for Constructing CALL System

Development of English Speech Database Read by Japanese to Support CALL Research

Using the HTK speech recogniser to anlayse prosody in a corpus of German spoken learner's English

The lexical element in spoken second language fluency

The CMU pronunciation dictionary.

A multi-accent nonnative English database

Construction and data analysis of a Chinese learner spoken English corpus

Field of Study

Venue Information

Name

Type

URL

Alternate Names