A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer (2021-12-09T00:00:00.000000Z)