Texts

SNLI-VE

Introduced in Visual Entailment: A Novel Task for Fine-Grained Image Understanding

About this Dataset

Visual Entailment (VE) consists of image-sentence pairs whereby a premise is defined by an image, rather than a natural language sentence as in traditional Textual Entailment tasks. The goal of a trained VE model is to predict whether the image semantically entails the text. SNLI-VE is a dataset for VE which is based on the Stanford Natural Language Inference corpus and Flickr30k dataset.

Source: https://github.com/necla-ml/SNLI-VE

Source: Visual Entailment: A Novel Task for Fine-Grained Image Understanding

Dataset Variants

SNLI-VE valSNLI-VE testSNLI-VE

Papers1

Visual Entailment: A Novel Task for Fine-Grained Image Understanding

A new inference task, Visual Entailed (VE) - consisting of image-sentence pairs whereby a premise is defined by an image, rather than a natural language sentence as in traditional Textual Entailment tasks is introduced.

Dataset Loaders

EDIT

🔥

allenai/allennlp-models

pytorch

📦

necla-ml/SNLI-VE

none

Tasks

EDIT

Visual Question Answering (VQA)Natural Language Inference Visual Reasoning Visual Entailment

Similar Datasets