A clustering framework for lexical normalization of Roman Urdu (2020-03-31T00:00:00.000000Z)