IEMOCAP Dataset - Papersgraph

Multimodal Emotion Recognition IEMOCAP The IEMOCAP dataset consists of 151 videos of recorded dialogues, with 2 speakers per session for a total of 302 videos across the dataset. Each segment is annotated for the presence of 9 emotions (angry, excited, fear, sad, surprised, frustrated, happy, disappointed and neutral) as well as valence, arousal and dominance. The dataset is recorded across 5 sessions with 5 pairs of speakers.

Source: Multi-attention Recurrent Network for Human Communication Comprehension Image Source: https://sail.usc.edu/iemocap/Busso_2008_iemocap.pdf

IEMOCAP

IEMOCAP: interactive emotional dyadic motion capture database

COCO (Common Objects in Context)

LRW

GOT-10k