3260 papers • 126 benchmarks • 313 datasets
Audio-visual zero-shot learning aims to recognize unseen categories based on paired audio-visual sequences.
(Image credit: Open Source)
These leaderboards are used to track progress in gzsl-video-classification-17
Use these libraries to find gzsl-video-classification-17 models and implementations
No datasets available.
No subtasks available.
Adding a benchmark result helps the community track progress.