3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in grounded-open-vocabulary-acquisition-3
No benchmarks available.
Use these libraries to find grounded-open-vocabulary-acquisition-3 models and implementations
No datasets available.
No subtasks available.
This work proposes World-to-Words (W2W), a novel visually-grounded language model by pre-training on image-text pairs highlighting grounding as an objective, and demonstrates that W2W is a more coherent and fast grounded word learner, and that the grounding ability acquired during pre- training helps the model to learn unseen words more rapidly and robustly.
Adding a benchmark result helps the community track progress.