Evaluation of Audio-Visual Alignments in Visually Grounded Speech Models (2021-07-05T00:00:00.000000Z)