HC-STVG1

Human-centric Spatio-Temporal Video Grounding

Introduced in Human-centric Spatio-Temporal Video Grounding With Visual Transformers2020

About this Dataset

The newly proposed HC-STVG task aims to localize the target person spatio-temporally in an untrimmed video. For this task, we collect a new benchmark dataset, which has spatio temporal annotations related to the target persons in complex multi-person scenes, together with full interaction and rich action information.

Source: Human-centric Spatio-Temporal Video Grounding With Visual Transformers

Dataset Variants

HC-STVG1

Papers1

Human-Centric Spatio-Temporal Video Grounding With Visual Transformers

This work introduces a novel task – Human-centric Spatio-Temporal Video Grounding (HC-STVG), which aims to localize a spatio-temporal tube of the target person from an untrimmed video based on a given textural description.

Tasks

EDIT

Spatio-Temporal Video Grounding

Similar Datasets

MNIST

CelebA

GLUE

Statistics

Papers

1

Tasks

32

Introduced

2020