3260 papers • 126 benchmarks • 313 datasets
The task aims to recognize adverbs beyond seen adverb-action compositions, i.e. compositions that were not seen during training.
(Image credit: Papersgraph)
These leaderboards are used to track progress in video-adverb-retrieval-unseen-compositions-5
Use these libraries to find video-adverb-retrieval-unseen-compositions-5 models and implementations
No subtasks available.
This work proposes a framework for video-to-adverb retrieval that aligns video embeddings with their matching compositional adverb-action text embedding in a joint embedding space and outperforms all prior works for the generalisation task of retrieving adverbs from videos for unseen adverbs-action compositions.
Adding a benchmark result helps the community track progress.