Ad-hoc video search

The Ad-hoc search task ended a 3 year cycle from 2016-2018 with a goal to model the end user search use-case, who is searching (using textual sentence queries) for segments of video containing persons, objects, activities, locations, etc. and combinations of the former. While the Internet Archive (IACC.3) dataset was adopted between 2016 to 2018, starting in 2019 a new data collection based on Vimeo Creative Commons (V3C) will be adopted to support the task for at least 3 more years. Given the test collection (V3C1 or IACC.3), master shot boundary reference, and set of Ad-hoc queries (approx. 30 queries) released by NIST, return for each query a list of at most 1000 shot IDs from the test collection ranked according to their likelihood of containing the target query.

Benchmarks

Libraries

Datasets

Subtasks

Most implemented papers

Dual Encoding for Zero-Example Video Retrieval

Content

W2VV++: Fully Deep Learning for Ad-hoc Video Search

Dual Encoding for Video Retrieval by Text

SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries

Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval

Interpretable Embedding for Ad-Hoc Video Search

(Un)likelihood Training for Interpretable Embedding