GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection (2021-11-29T00:00:00.000000Z)

TL;DR

GALAXY is a novel pre-trained dialog model that explicitly learns dialog policy from limited labeled dialogs and large-scale unlabeled dialog corpora via semi-supervised learning and has a stronger few-shot ability than existing models under various low-resource settings.

Abstract

Pre-trained models have proved to be powerful in enhancing task-oriented dialog systems. However, current pre-training methods mainly focus on enhancing dialog understanding and generation tasks while neglecting the exploitation of dialog policy. In this paper, we propose GALAXY, a novel pre-trained dialog model that explicitly learns dialog policy from limited labeled dialogs and large-scale unlabeled dialog corpora via semi-supervised learning. Specifically, we introduce a dialog act prediction task for policy optimization during pre-training and employ a consistency regularization term to refine the learned representation with the help of unlabeled dialogs. We also implement a gating mechanism to weigh suitable unlabeled dialog samples. Empirical results show that GALAXY substantially improves the performance of task-oriented dialog systems, and achieves new state-of-the-art results on benchmark datasets: In-Car, MultiWOZ2.0 and MultiWOZ2.1, improving their end-to-end combined scores by 2.5, 5.3 and 5.5 points, respectively. We also show that GALAXY has a stronger few-shot ability than existing models under various low-resource settings. For reproducibility, we release the code and data at https://github.com/siat-nlp/GALAXY.

Authors

Luo Si

13 papers

Yinhe Zheng

6 papers

Yuchuan Wu

3 papers

TL;DR

Abstract

Authors

References80 items

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Variational Latent-State GPT for Semi-Supervised Task-Oriented Dialog Systems

Soloist: Building Task Bots at Scale with Transfer Learning and Machine Teaching

Transferable Dialogue Systems and User Simulators

R-Drop: Regularized Dropout for Neural Networks

Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances

Dialogue-oriented Pre-training

Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialogue State Tracking

SimCSE: Simple Contrastive Learning of Sentence Embeddings

Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems

Pretraining the Noisy Channel Model for Task-Oriented Dialogue

Domain State Tracking for a Simplified Dialogue System

Multi-goal multi-agent learning for task-oriented dialogue with bidirectional teacher-student learning

UBAR: Towards Fully End-to-End Task-Oriented Dialog Systems with GPT-2

Exploring Simple Siamese Representation Learning

LAVA: Latent Action Spaces via Variational Auto-encoding for Dialogue Policy Optimization

Amalgamating Knowledge from Two Teachers for Task-oriented Dialogue System with Adversarial Training

Probing Task-Oriented Dialogue Representation from Language Models

STAR: A Schema-Guided Dialog Dataset for Transfer Learning

DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue

MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems

A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning

Modelling Hierarchical Structure between Dialogue Policy and Natural Language Generator with Option Framework for Task-oriented Dialogue System

A Simple Language Model for Task-Oriented Dialogue

Recipes for Building an Open-Domain Chatbot

Multi-Domain Dialogue Acts and Response Co-Generation

PRAL: A Tailored Pre-Training Model for Task-Oriented Dialog Generation

Paraphrase Augmented Task-Oriented Dialog Generation

TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue

An Empirical Investigation of Pre-Trained Transformer Language Models for Open-Domain Dialogue Generation

Few-shot Natural Language Generation for Task-Oriented Dialog

Towards a Human-like Open-Domain Chatbot

Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context

ConveRT: Efficient and Accurate Conversational Representations from Transformers

DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation

PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable

Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations

Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset

Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset

Coached Conversational Preference Elicitation: A Case Study in Understanding Movie Preferences

Flexibly-Structured Model for Task-Oriented Dialogues

AmazonQA: A Review-Based Question Answering Task

ERNIE 2.0: A Continual Pre-training Framework for Language Understanding

Structured Fusion Networks for Dialog

Hello, It’s GPT-2 - How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems

Towards Universal Dialogue Act Tagging for Task-Oriented Dialogues

Few-Shot Dialogue Generation Without Annotated Data: A Transfer Learning Approach

An Introduction to Variational Autoencoders

Pretraining Methods for Dialog Context Representation Learning

Unified Language Model Pre-training for Natural Language Understanding and Generation

Interpolation Consistency Training for Semi-Supervised Learning

Talking to myself: self-dialogues as data for conversational agents

MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

Explicit State Tracking with Semi-Supervisionfor Neural Dialogue Generation

Microsoft Dialogue Challenge: Building End-to-End Task-Completion Dialogue Systems

Sequicity: Simplifying Task-oriented Dialogue Systems with Single Sequence-to-Sequence Architectures

Bootstrapping a Neural Conversational Agent with Dialogue Self-Play, Crowdsourcing and On-Line Reinforcement Learning

Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

Personalizing Dialogue Agents: I have a dog, do you have pets too?

DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management

Key-Value Retrieval Networks for Task-Oriented Dialogue

Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning

Frames: a corpus for adding memory to goal-oriented dialogue systems

Neural Belief Tracker: Data-Driven Dialogue State Tracking

The Second Dialog State Tracking Challenge

Chameleons in Imagined Conversations: A New Approach to Understanding Coordination of Linguistic Style in Dialogs

Towards an ISO Standard for Dialogue Act Annotation

Bleu: a Method for Automatic Evaluation of Machine Translation

An Investigation of Suitability of Pre-Trained Language Models for Dialogue Generation – Avoiding Discrepancies

AuGPT: Dialogue with Pre-trained Language Models and Data Augmentation

SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing

Conversational Scaffolding: An Analogy-based Approach to Response Prioritization in Open-domain Dialogs

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Language Models are Unsupervised Multitask Learners

GENERATIVE ADVERSARIAL NETS