Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering (2017-07-25T00:00:00.000000Z)