The WiLI benchmark dataset for written language identification (2018-01-23T00:00:00.000000Z)