Introduced in HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips2019
HowTo100M is a large-scale dataset of narrated videos with an emphasis on instructional videos where content creators teach complex tasks with an explicit intention of explaining the visual content on screen. HowTo100M features a total of:
Each video is associated with a narration available as subtitles automatically downloaded from Youtube.
Source: HowTo100M