3260 papers • 126 benchmarks • 313 datasets
Multimodal machine translation is the task of doing machine translation with multiple data sources - for example, translating "a bird is flying over water" + an image of a bird over water to German text. ( Image credit: Findings of the Third Shared Task on Multimodal Machine Translation )
(Image credit: Open Source)
These leaderboards are used to track progress in multimodal-machine-translation-19
Use these libraries to find multimodal-machine-translation-19 models and implementations
No datasets available.
Adding a benchmark result helps the community track progress.