VinVL: Revisiting Visual Representations in Vision-Language Models - Citation Graph | Papersgraph