Safe mutations for deep and recurrent neural networks through output gradients (2017-12-18T00:00:00.000000Z)

TL;DR

A family of safe mutation (SM) operators that facilitate exploration without dramatically altering network behavior or requiring additional interaction with the environment are proposed, which dramatically increases the ability of a simple genetic algorithm-based neuroevolution method to find solutions in high-dimensional domains that require deep and/or recurrent neural networks.

Abstract

While neuroevolution (evolving neural networks) has been successful across a variety of domains from reinforcement learning, to artificial life, to evolutionary robotics, it is rarely applied to large, deep neural networks. A central reason is that while random mutation generally works in low dimensions, a random perturbation of thousands or millions of weights will likely break existing functionality. This paper proposes a solution: a family of safe mutation (SM) operators that facilitate exploration without dramatically altering network behavior or requiring additional interaction with the environment. The most effective SM variant scales the degree of mutation of each individual weight according to the sensitivity of the network's outputs to that weight, which requires computing the gradient of outputs with respect to the weights (instead of the gradient of error, as in conventional deep learning). This safe mutation through gradients (SM-G) operator dramatically increases the ability of a simple genetic algorithm-based neuroevolution method to find solutions in high-dimensional domains that require deep and/or recurrent neural networks, including domains that require processing raw pixels. By improving our ability to evolve deep neural networks, this new safer approach to mutation expands the scope of domains amenable to neuroevolution.

Authors

J. Clune

11 papers

Kenneth O. Stanley

7 papers

J. Lehman

7 papers

TL;DR

Abstract

Authors

References57 items

ES is more than just a traditional finite-difference approximator

Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

Genetic Policy Optimization

Hierarchical Representations for Efficient Architecture Search

Self-Normalizing Neural Networks

Curiosity-Driven Exploration by Self-Supervised Prediction

Evolution Strategies as a Scalable Alternative to Reinforcement Learning

Evolving Deep Neural Networks

PathNet: Evolution Channels Gradient Descent in Super Neural Networks

Learning to Navigate in Complex Environments

A Wavelet-based Encoding for Neuroevolution

Quality Diversity: A New Frontier for Evolutionary Computation

Wide Residual Networks

Deep Recurrent Q-Learning for Partially Observable MDPs

Illuminating search spaces by mapping elites

Trust Region Policy Optimization

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Adam: A Method for Stochastic Optimization

Evolutionary robotics

Robots that can adapt like animals

Evolving deep unsupervised convolutional networks for vision-based reinforcement learning

A Neuroevolution Approach to General Atari Game Playing

Evolvability Is Inevitable: Increasing Evolvability without the Pressure to Adapt

ImageNet classification with deep convolutional neural networks

On the difficulty of training recurrent neural networks

Enhancing es-hyperneat to evolve more complex regular neural networks

Deep Sparse Rectifier Neural Networks

Improving evolvability through novelty search and self-adaptation

Abandoning Objectives: Evolution Through the Search for Novelty Alone

Autonomous Evolution of Topographic Regularities in Artificial Neural Networks

Understanding the difficulty of training deep feedforward neural networks

A Hypercube-Based Encoding for Evolving Large-Scale Neural Networks

Natural Evolution Strategies

Neuroevolution: from architectures to learning

Compositional pattern producing networks: A novel abstraction of development

Resilient Machines Through Continuous Self-Modeling

Editorial

A Taxonomy for Artificial Embryogeny

Reducing the Time Complexity of the Derandomized Evolution Strategy with Covariance Matrix Adaptation (CMA-ES)

Evolving Neural Networks through Augmenting Topologies

Evolution of digital organisms at high mutation rates leads to survival of the flattest

Body-brain co-evolution using L-systems as a generative encoding

Open Problems in Artificial Life

Evolving artificial neural networks

Long Short-Term Memory

A direct adaptive method for faster backpropagation learning: the RPROP algorithm

Pygame learning environment

Deep Learning

Coevolutionary Principles

Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude

Neuroevolution

A Hypercube-Based Indirect Encoding for Evolving Large-Scale Neural Networks

Self-Adaptation in Evolutionary Algorithms

A Survey of Optimization by Building and Using Probabilistic Models

Neural Network Synthesis using Cellular Encoding and the Genetic Algorithm

Ieee Transactions on Evolutionary Computation on the Performance of Indirect Encoding across the Continuum of Regularity

Artificial Life

Field of Study

Journal Information

Name

Venue Information

Name

Type

URL

Alternate Names