Momentum via Primal Averaging: Theoretical Insights and Learning Rate Schedules for Non-Convex Optimization (2020-10-01T00:00:00.000000Z)