3260 papers • 126 benchmarks • 313 datasets
Temporal sentence grounding (TSG) aims to locate a specific moment from an untrimmed video with a given natural language query. For this task, different levels of supervision are used. 1) Weak supervision: video-level action category set; 2) Semi-weak supervision: video-level action category set, and action annotations at several timestamps; 3) Full supervision: Action category and action interval annotations of all actions in untrimmed videos.
(Image credit: Papersgraph)
These leaderboards are used to track progress in temporal-sentence-grounding-30
Use these libraries to find temporal-sentence-grounding-30 models and implementations
No subtasks available.
Adding a benchmark result helps the community track progress.