mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections (2022-05-24T00:00:00.000000Z)