Cross-Lingual Natural Language Generation via Pre-Training (2019-09-23T00:00:00.000000Z)

TL;DR

Experimental results on question generation and abstractive summarization show that the model outperforms the machine-translation-based pipeline methods for zero-shot cross-lingual generation and improves NLG performance of low-resource languages by leveraging rich-resource language data.

Abstract

In this work we focus on transferring supervision signals of natural language generation (NLG) tasks between multiple languages. We propose to pretrain the encoder and the decoder of a sequence-to-sequence model under both monolingual and cross-lingual settings. The pre-training objective encourages the model to represent different languages in the shared space, so that we can conduct zero-shot cross-lingual transfer. After the pre-training procedure, we use monolingual data to fine-tune the pre-trained model on downstream NLG tasks. Then the sequence-to-sequence model trained in a single language can be directly evaluated beyond that language (i.e., accepting multi-lingual input and producing multi-lingual output). Experimental results on question generation and abstractive summarization show that our model outperforms the machine-translation-based pipeline methods for zero-shot cross-lingual generation. Moreover, cross-lingual transfer improves NLG performance of low-resource languages by leveraging rich-resource language data. Our implementation and data are available at this https URL.

Authors

Li Dong

11 papers

Zewen Chi

5 papers

Heyan Huang

8 papers

TL;DR

Abstract

Authors

References38 items

NCLS: Neural Cross-Lingual Summarization

Denoising based Sequence-to-Sequence Pre-training for Text Generation

SpanBERT: Improving Pre-training by Representing and Predicting Spans

Zero-Shot Cross-Lingual Abstractive Sentence Summarization through Teaching Generation and Attention

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Cross-Lingual Training for Automatic Question Generation

Improved Zero-shot Neural Machine Translation via Ignoring Spurious Correlations

Unified Language Model Pre-training for Natural Language Understanding and Generation

MASS: Masked Sequence to Sequence Pre-training for Language Generation

Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT

Cross-lingual Language Model Pretraining

Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond

Zero-Shot Cross-Lingual Neural Headline Generation

XNLI: Evaluating Cross-lingual Sentence Representations

Harvesting Paragraph-level Question-Answer Pairs from Wikipedia

Deep Contextualized Word Representations

Word Translation Without Parallel Data

Learned in Translation: Contextualized Word Vectors

Attention is All you Need

Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation

Dataset and Neural Recurrent Sequence Labeling Model for Open-Domain Factoid Question Answering

Gaussian Error Linear Units (GELUs)

SQuAD: 100,000+ Questions for Machine Comprehension of Text

The United Nations Parallel Corpus v1.0

Neural Machine Translation of Rare Words with Subword Units

Exploiting Similarities among Languages for Machine Translation

Cross-Language Document Summarization Based on Machine Translation Quality Prediction

Extracting and composing robust features with denoising autoencoders

Optimizing Chinese Word Segmentation for Machine Translation Performance

“Cloze Procedure”: A New Tool for Measuring Readability

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

and Conneau

Improving Language Understanding by Generative Pre-Training

Paragraph-level Neural Question Generation with Maxout Pointer and Gated Self-attention Networks

and Schwenk

and Cardie

and Gimpel

Normalized Word Embedding and Orthogonal Transform for Bilingual Word Translation

Field of Study

Venue Information

Name

Type

URL

Alternate Names