DAQUAR (DAtaset for QUestion Answering on Real-world images) is a dataset of human question answer pairs about images.
Source: A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input Image Source: https://www.mpi-inf.mpg.de/departments/computer-vision-and-multimodal-computing/research/vision-and-language/visual-turing-challenge/
Unknown