3260 papers • 126 benchmarks • 313 datasets
Human Interaction Recognition (HIR) is a field of study that involves the development of computer algorithms to detect and recognize human interactions in videos, images, or other multimedia content. The goal of HIR is to automatically identify and analyze the social interactions between people, their body language, and facial expressions.
(Image credit: Papersgraph)
These leaderboards are used to track progress in human-interaction-recognition-6
Use these libraries to find human-interaction-recognition-6 models and implementations
This work proposes a simpler yet very powerful architecture, named Interaction Relational Network, which utilizes minimal prior knowledge about the structure of the human body to identify by itself how to relate the body parts from the individuals interacting and is capable of paramount sequential relational reasoning.
This work learns Slow-Fast auditory streams with separable convolutions and multi-level lateral connections in a two-stream convolutional network for audio recognition, that operates on time-frequency spectrogram inputs.
A novel unified two-person graph to represent inter-body and intra-body correlations between joints and several graph labeling strategies are designed to supervise the model to learn discriminant spatial-temporal interactive features.
Entity Rearrangement is proposed to eliminate the orderliness in ISTs for interchangeable entities, which is a unified way to represent motions of multiple diverse entities by extending the entity dimension, which provides better interactive representations.
Adding a benchmark result helps the community track progress.