SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing (2021-10-14T00:00:00.000000Z)