Automatic punctuation restoration with BERT models (2021-01-18T00:00:00.000000Z)

TL;DR

An approach for automatic punctuation restoration with BERT models for English and Hungarian is presented, which achieves a macro-averaged $F_1$-score of 79.8 in English and 82.2 in Hungarian.

Abstract

We present an approach for automatic punctuation restoration with BERT models for English and Hungarian. For English, we conduct our experiments on Ted Talks, a commonly used benchmark for punctuation restoration, while for Hungarian we evaluate our models on the Szeged Treebank dataset. Our best models achieve a macro-averaged $F_1$-score of 79.8 in English and 82.2 in Hungarian. Our code is publicly available.

Authors

A. Nagy

1 papers

Bence Bial

1 papers

Judit Ács

2 papers

References23 items

Efficient Automatic Punctuation Restoration Using Bidirectional Transformers with Robust Inference

Controllable Time-Delay Transformer for Real-Time Punctuation Prediction and Disfluency Detection

Question Mark Prediction By Bert

2019 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)

Automatic punctuation restoration with BERT models

TL;DR

Abstract

Authors

References23 items

Efficient Automatic Punctuation Restoration Using Bidirectional Transformers with Robust Inference

Controllable Time-Delay Transformer for Real-Time Punctuation Prediction and Disfluency Detection

Question Mark Prediction By Bert

2019 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Leveraging a Character, Word and Prosody Triplet for an ASR Error Robust and Agglutination Friendly Punctuation Approach

Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging

Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punctuation Restoration

Sentiment Analysis on Tweets with Punctuations, Emoticons, and Negations

Analysis of Punctuation Prediction Models for Automated Transcript Generation in MOOC Videos

User-centric Evaluation of Automatic Punctuation in ASR Closed Captioning

Punctuation Prediction Model for Conversational Speech

Decoupled Weight Decay Regularization

Á bilingual comparison of MaxEnt-and RNN-based punctuation restoration in speech transcripts

Attention is All you Need

Bidirectional Recurrent Neural Network with Attention Mechanism for Punctuation Restoration

The Szeged Treebank

Natural Language Processing Methods for Language Modeling

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

A Comparison of Different Punctuation Prediction Approaches in a Translation Context

Deep Learning for Punctuation Restoration in Medical Reports

Overview of the IWSLT 2012 evaluation campaign

The effects of speech recognition and punctuation on information extraction performance

Field of Study

Journal Information

Name

Volume

Venue Information

Name

Type

URL

Alternate Names