Lottery tickets in evolutionary optimization: On sparse backpropagation-free trainability (2023-05-31T00:00:00.000000Z)

TL;DR

A novel signal-to-noise (SNR) iterative pruning procedure is introduced, which extracts evolvable sub-networks and incorporates loss curvature information into the network pruning step and finds that these initializations encode an inductive bias, which transfers across different evolution strategies, related tasks and even GD-based training.

Abstract

Lottery tickets in Deep Learning [2] refer to highly sparse neural network initializations, which train to the performance level of their dense counterparts The existence of such sparse trainable initializations has previously been documented for a variety of gradient-based training settings. But is the lottery ticket phenomenon an idiosyncrasy of stochastic gradient descent or does it generalize to evolutionary optimization? In this paper we establish the existence of highly sparse trainable initializations for evolution strategies (ES) and characterize qualitative differences compared to gradient descent (GD)-based sparse training. We introduce a novel signal-to-noise (SNR) iterative pruning procedure, which extracts evolvable sub-networks and incorporates loss curvature information into the network pruning step. We demonstrate the existence of highly sparse evolvable initializations for a wide range of network architectures, evolution strategies and task settings. Furthermore, we find that these initializations encode an inductive bias, which transfers across different evolution strategies, related tasks and even GD-based training. Finally, we compare the local optima resulting from the different optimization paradigms and sparsity levels. In contrast to GD, ES explore diverse and flat local optima and do not preserve linear mode connectivity across sparsity levels and independent runs. The full paper was accepted at the ICML conference [4].

Authors

R. Lange

1 papers

Henning Sprekeler

1 papers

TL;DR

Abstract

Authors

References48 items

Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization

Discovering Evolution Strategies via Meta-Black-Box Optimization

Effective mutation rate adaptation through group elite selection

EvoJAX: hardware-accelerated neuroevolution

Brax - A Differentiable Physics Engine for Large Scale Rigid Body Simulation

Towards Understanding Iterative Magnitude Pruning: Why Lottery Tickets Win

On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning

A Unified Lottery Ticket Hypothesis for Graph Neural Networks

Gradient Flow in Sparse Neural Networks and How Lottery Tickets Win

Characterising Bias in Compressed Models

Winning Lottery Tickets in Deep Generative Models

The Lottery Ticket Hypothesis for Pre-trained BERT Networks

Array programming with NumPy

The Early Phase of Neural Network Training

Linear Mode Connectivity and the Lottery Ticket Hypothesis

Playing the lottery with rewards and multiple languages: lottery tickets in RL and NLP

One ticket to win them all: generalizing lottery ticket initializations across datasets and optimizers

Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask

Stabilizing the Lottery Ticket Hypothesis

A Simple Yet Efficient Evolution Strategy for Large-Scale Black-Box Optimization

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

Visualizing the Loss Landscape of Neural Nets

Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

Proximal Policy Optimization Algorithms

Scalable training of artificial neural networks with adaptive sparse connectivity inspired by network science

Evolution Strategies as a Scalable Alternative to Reinforcement Learning

A topological insight into restricted Boltzmann machines

Weight Uncertainty in Neural Network

Learning both Weights and Connections for Efficient Neural Network

The lottery tickets.

High dimensions and heavy tails for natural evolution strategies

2010 Special Issue: Parameter-exploring policy gradients

A Simple Modification in CMA-ES Achieving Linear Time and Space Complexity

Natural Selection Fails to Optimize Mutation Rates for Long-Term Adaptation on Rugged Fitness Landscapes

Matplotlib: A 2D Graphics Environment

Evolving Neural Networks through Augmenting Topologies

Completely Derandomized Self-Adaptation in Evolution Strategies

evosax: Jax-based evolution strategies, 2022a. URL http://github.com/RobertTLange/ evosax

Gymnax: Lange (2022b), Evojax: Tang et al

gymnax: A JAX-based reinforcement learning environment library, 2022b

Seaborn: Statistical Data Visualization

MLE-Infrastructure: A set of lightweight tools for distributed machine learning experimentation, 2021

Simple random search of static linear policies is competitive for reinforcement learning

JAX: composable transformations of Python+NumPy programs, 2018

Evolutionsstrategien

While GD-based trained ticket initializations preserve linear mode connectivity at low levels of sparsity, ES tickets do not

2023. Lottery tickets in evolutionary optimization

Jax-based

Field of Study

Journal Information

Name