MultiWOZ 2.3: A Multi-domain Task-Oriented Dialogue Dataset Enhanced with Annotation Corrections and Co-Reference Annotation (2020-10-12T00:00:00.000000Z)

TL;DR

This paper introduces MultiWOZ 2.3, in which it differentiate incorrect annotations in dialogue acts from dialogue states, and identifies a lack of co-reference when publishing the updated dataset, to ensure consistency between dialogue acts and dialogue states.

Abstract

Task-oriented dialogue systems have made unprecedented progress with multiple state-of-the-art (SOTA) models underpinned by a number of publicly available MultiWOZ datasets. Dialogue state annotations are error-prone, leading to sub-optimal performance. Various efforts have been put in rectifying the annotation errors presented in the original MultiWOZ dataset. In this paper, we introduce MultiWOZ 2.3, in which we differentiate incorrect annotations in dialogue acts from dialogue states, identifying a lack of co-reference when publishing the updated dataset. To ensure consistency between dialogue acts and dialogue states, we implement co-reference features and unify annotations of dialogue acts and dialogue states. We update the state of the art performance of natural language understanding and dialogue state tracking on MultiWOZ 2.3, where the results show significant improvements than on previous versions of MultiWOZ datasets (2.0-2.2).

Authors

Minlie Huang

23 papers

Wei Peng

2 papers

Ting Han

1 papers

TL;DR

Abstract

Authors

References32 items

Slot Attention with Value Normalization for Multi-domain Dialogue State Tracking

DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue

MultiWOZ 2.2 : A Dialogue Dataset with Additional Annotation Corrections and State Tracking Baselines

TripPy: A Triple Copy Strategy for Value Independent Neural Dialog State Tracking

CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset

ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems

Coreference Resolution: Toward End-to-End and Cross-Lingual Systems

Efficient Dialogue State Tracking by Selectively Overwriting Memory

Multi-domain Dialogue State Tracking as Dynamic Knowledge Graph Enhanced Question Answering

Improving Open-Domain Dialogue Systems via Multi-Turn Incomplete Utterance Restoration

Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking

GECOR: An End-to-End Generative Ellipsis and Co-reference Resolution Model for Task-Oriented Dialogue

Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset

Scalable and Accurate Dialogue State Tracking via Hierarchical Sequence Generation

Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog

Dialog State Tracking: A Neural Reading Comprehension Approach

SUMBT: Slot-Utterance Matching for Universal and Scalable Belief Tracking

Improving Multi-turn Dialogue Modelling with Utterance ReWriter

Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention

Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems

ConvLab: Multi-Domain End-to-End Dialog System Platform

BERT for Joint Intent Classification and Slot Filling

Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models

User Modeling for Task Oriented Dialogues

MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

A Network-based End-to-End Trainable Task-oriented Dialogue System

The Second Dialog State Tracking Challenge

The Dialog State Tracking Challenge

Agenda-Based User Simulation for Bootstrapping a POMDP Dialogue System

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

MultiWOZ 2.

of the Association for Computational Linguistics

Field of Study

Venue Information

Name

Type

URL

Alternate Names