Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7 (2018-06-01T00:00:00.000000Z)

TL;DR

The Audio Visual Scene Aware Dialog (AVSD) challenge and dataset is introduced, which is to build a system that generates responses in a dialog about an input video.

Abstract

Scene-aware dialog systems will be able to have conversations with users about the objects and events around them. Progress on such systems can be made by integrating state-of-the-art technologies from multiple research areas including end-to-end dialog systems visual dialog, and video description. We introduce the Audio Visual Scene Aware Dialog (AVSD) challenge and dataset. In this challenge, which is one track of the 7th Dialog System Technology Challenges (DSTC7) workshop1, the task is to build a system that generates responses in a dialog about an input video

Authors

Tim K. Marks

4 papers

Devi Parikh

32 papers

Dhruv Batra

43 papers

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7

TL;DR

Abstract

Authors

References9 items

End-to-end Conversation Modeling Track in DSTC6

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning

Attention-Based Multimodal Fusion for Video Description

Visual Dialog

GuessWhat?! Visual Object Discovery through Multi-modal Dialogue

Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding

The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems

VQA: Visual Question Answering

Translating Videos to Natural Language Using Deep Recurrent Neural Networks

Field of Study

Journal Information

Name

Volume

Venue Information

Name

Type

URL

Alternate Names