3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in few-shot-video-question-answering-23
No benchmarks available.
Use these libraries to find few-shot-video-question-answering-23 models and implementations
No datasets available.
No subtasks available.
A parameter-efficient method is introduced, combining multimodal prompt learning and a transformer-based mapping network, while keeping the pretrained models frozen to address challenges such as overfitting, catastrophic forgetting, and the cross-modal gap between vision and language.
Adding a benchmark result helps the community track progress.