Multimodal Grounding for Sequence-to-sequence Speech Recognition (2018-11-09T00:00:00.000000Z)