3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in short-term-object-interaction-anticipation-9
Use these libraries to find short-term-object-interaction-anticipation-9 models and implementations
No subtasks available.
In these five tasks, the performance of InternVideo-Ego4D comprehensively surpasses the baseline methods and the champions of CVPR2022, demonstrating the powerful representation ability of Intern video as a video foundation model.
Baseline results show that the ENIGMA-51 dataset poses a challenging benchmark to study human behavior in industrial scenarios.
This paper studied the short-term object interaction anticipation problem from the egocentric point of view, proposing a new end-to-end architecture named StillFast, which is ranked first in the public leaderboard of the EGO4D short term object interaction expectation challenge 2022 and it is the official baseline for the 2023 one.
The experimental results performed on the largest egocentric dataset demonstrate that GANO outperforms the existing state-of-the-art methods for the prediction of the next active object label, its bounding box location, the corresponding future action, and the time to contact the object.
This paper proposes InterDiff, a framework comprising two key steps: interaction diffusion, where a diffusion model is leverage to encode the distribution of future human-object interactions; and interaction correction, where a physics-informed predictor is introduced to correct denoised HOIs in a diffusion step.
Adding a benchmark result helps the community track progress.