PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning (2020-06-30T00:00:00.000000Z)

TL;DR

To build a high-quality open-domain chatbot, this work introduces the effective training process of PLATO-2 via curriculum learning, achieving new state-of-the-art results.

Abstract

To build a high-quality open-domain chatbot, we introduce the effective training process of PLATO-2 via curriculum learning. There are two stages involved in the learning process. In the first stage, a coarse-grained generation model is trained to learn response generation under the simplified framework of one-to-one mapping. In the second stage, a fine-grained generation model and an evaluation model are further trained to learn diverse response generation and response coherence estimation, respectively. PLATO-2 was trained on both Chinese and English data, whose effectiveness and superiority are verified through comprehensive evaluations, achieving new state-of-the-art results.

Authors

Hua Wu

12 papers

Haifeng Wang

11 papers

Xinchao Xu

2 papers

TL;DR

Abstract

Authors

References58 items

A Unified Pre-training Framework for Conversational AI

Language Models are Few-Shot Learners

Recipes for Building an Open-Domain Chatbot

Can You Put it All Together: Evaluating Conversational Agents’ Ability to Blend Skills

Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue

Towards a Human-like Open-Domain Chatbot

The Pushshift Reddit Dataset

DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation

PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

ACUTE-EVAL: Improved Dialogue Evaluation with Optimized Questions and Multi-turn Comparisons

Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection

Know More about Each Other: Evolving Dialogue Strategy via Compound Assessment

Which Tasks Should Be Learned Together in Multi-task Learning?

Unified Language Model Pre-training for Natural Language Understanding and Generation

The Second Conversational Intelligence Challenge (ConvAI2)

The Design and Implementation of XiaoIce, an Empathetic Social Chatbot

Towards Empathetic Open-domain Conversation Models: A New Benchmark and Dataset

Wizard of Wikipedia: Knowledge-Powered Conversational agents

Commonsense Knowledge Aware Conversation Generation with Graph Attention

Neural Approaches to Conversational AI

Personalizing Dialogue Agents: I have a dog, do you have pets too?

DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

Attention is All you Need

Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders

Categorical Reparameterization with Gumbel-Softmax

Deep Reinforcement Learning for Dialogue Generation

Training Deep Nets with Sublinear Memory Cost

A Diversity-Promoting Objective Function for Neural Conversation Models

Neural Machine Translation of Rare Words with Subword Units

A Neural Conversational Model

Adam: A Method for Stochastic Optimization

Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation

Curriculum learning

Human-bot chat examples by Microsoft XiaoIce and PLATO-2. References Daniel Adiwardana

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Language Models are Unsupervised Multitask Learners

Grounded Response Generation Task at DSTC7

Improving Language Understanding by Generative Pre-Training

The TREC-8 Question Answering Track Report

Modern information retrieval

Measuring nominal scale agreement among many raters.

学会了带我一起 Can you swim? Yes, but not so good at it

That is great. I would like to learn swimming too. Go for it. We can go swimming together then

contains 684M (context, response) samples and the Chinese training data contains 1.2B (context,

the embedding dimension of 1024. The 93M parameter model has 12 transformer blocks and 12 attention heads, with the embedding dimension of 768

The message contains special strings, such as r/, u/, &amp

The message contains URL

Let us find a time to go swimming together. All right. Let us have dinner together. Swimming first, then dinner

Any word has more than 30 characters or the message has more than 1024 characters

Response coherence estimation

Don't worry. Swim ring. Or I can teach you

The percentage of alphabetic characters is less than 70%

Then I am sure I can't teach you

A quick question, do you know how to insert the code of instant chat into the webpage?

Aren't you afraid I will throw you into the river? 4: Human-bot chat examples by Microsoft XiaoIce and PLATO-2. Score

How did you learn it? Is there any swimming course?

要吃啥 吃代码 What are you going to eat? , I am coding too

Field of Study

Venue Information

Name

Type

URL

要吃啥吃代码 What are you going to eat? , I am coding too