What Makes A Good Story? Designing Composite Rewards for Visual Storytelling

Published in

AAAI Conference on Artificial Intelligence(2019)

External Links:

Generate Graph DownloadPDF

TL;DR

This paper proposes three assessment criteria: relevance, coherence and expressiveness, which are observed through empirical analysis could constitute a “high-quality” story to the human eye and proposes a reinforcement learning framework, ReCo-RL, with reward functions designed to capture the essence of these quality criteria.

Abstract

Previous storytelling approaches mostly focused on optimizing traditional metrics such as BLEU, ROUGE and CIDEr. In this paper, we re-examine this problem from a different angle, by looking deep into what defines a natural and topically-coherent story. To this end, we propose three assessment criteria: relevance, coherence and expressiveness, which we observe through empirical analysis could constitute a “high-quality” story to the human eye. We further propose a reinforcement learning framework, ReCo-RL, with reward functions designed to capture the essence of these quality criteria. Experiments on the Visual Storytelling Dataset (VIST) with both automatic and human evaluation demonstrate that our ReCo-RL model achieves better performance than state-of-the-art baselines on both traditional metrics and the proposed new criteria.

Authors

Junjie Hu

3 papers

Graham Neubig

38 papers

Zhe Gan

19 papers

References42 items

UNITER: UNiversal Image-TExt Representation Learning

UNITER: Learning UNiversal Image-TExt Representations

Domain Adaptive Text Style Transfer

Plan-And-Write: Towards Better Automatic Storytelling

Controllable Neural Story Plot Generation via Reward Shaping

What Makes A Good Story? Designing Composite Rewards for Visual Storytelling

Published in

AAAI Conference on Artificial Intelligence(2019)

External Links:

Generate Graph DownloadPDF

TL;DR

Abstract

Authors

Junjie Hu

3 papers

Graham Neubig

38 papers

Zhe Gan

19 papers

References42 items

UNITER: UNiversal Image-TExt Representation Learning

UNITER: Learning UNiversal Image-TExt Representations

Domain Adaptive Text Style Transfer

Plan-And-Write: Towards Better Automatic Storytelling

Controllable Neural Story Plot Generation via Reward Shaping

Jianfeng Gao

39 papers

Jingjing Liu

13 papers

Learning End-to-End Goal-Oriented Dialog with Multiple Answers

Learning Neural Templates for Text Generation

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation

Hierarchical Neural Story Generation

Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation

Towards Diverse Text Generation with Inverse Reinforcement Learning

No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling

Show, Reward and Tell: Automatic Generation of Narrative Paragraph From Photo Stream by Adversarial Training

Hierarchically-Attentive RNN for Album Summarization and Storytelling

Adversarial Feature Matching for Text Generation

A Deep Reinforced Model for Abstractive Summarization

Dense-Captioning Events in Videos

Towards Diverse and Natural Image Descriptions via a Conditional GAN

Self-Critical Sequence Training for Image Captioning

Semantic Compositional Networks for Visual Captioning

Professor Forcing: A New Algorithm for Training Recurrent Networks

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

SPICE: Semantic Propositional Image Caption Evaluation

Generative Adversarial Imitation Learning

Visual Storytelling

Image Captioning with Semantic Attention

Deep Residual Learning for Image Recognition

Expressing an Image Stream with a Sequence of Natural Sentences

Sequence Level Training with Recurrent Neural Networks

Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks

A Hierarchical Neural Autoencoder for Paragraphs and Documents

CIDEr: Consensus-based image description evaluation

From captions to visual concepts and back

Show and tell: A neural image caption generator

Sequence to Sequence Learning with Neural Networks

METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments

ROUGE: A Package for Automatic Evaluation of Summaries

Bleu: a Method for Automatic Evaluation of Machine Translation

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

Measuring nominal scale agreement among many raters.

Field of Study

Computer Science

Journal Information

Name

ArXiv

Volume

abs/2005.00687

Venue Information

Name

AAAI Conference on Artificial Intelligence

Type

conference

URL

http://www.aaai.org/

Alternate Names

National Conference on Artificial Intelligence
National Conf Artif Intell
AAAI Conf Artif Intell
AAAI