Adapting multi-armed bandits policies to contextual bandits scenarios (2018-11-11T00:00:00.000000Z)