Progressive Neural Networks (2016-06-15T00:00:00.000000Z)

TL;DR

This work evaluates the progressive networks architecture extensively, and shows that it outperforms common baselines based on pretraining and finetuning and demonstrates that transfer occurs at both low-level sensory and high-level control layers of the learned policy.

Abstract

Learning to solve complex sequences of tasks--while both leveraging transfer and avoiding catastrophic forgetting--remains a key obstacle to achieving human-level intelligence. The progressive networks approach represents a step forward in this direction: they are immune to forgetting and can leverage prior knowledge via lateral connections to previously learned features. We evaluate this architecture extensively on a wide variety of reinforcement learning tasks (Atari and 3D maze games), and show that it outperforms common baselines based on pretraining and finetuning. Using a novel sensitivity measure, we demonstrate that transfer occurs at both low-level sensory and high-level control layers of the learned policy.

Authors

K. Kavukcuoglu

26 papers

Razvan Pascanu

15 papers

J. Kirkpatrick

4 papers

Progressive Neural Networks

TL;DR

Abstract

Authors

References24 items

A Deep Hierarchical Approach to Lifelong Learning in Minecraft

Asynchronous Methods for Deep Reinforcement Learning

Policy Distillation

Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning

Knowledge Transfer in Deep Block-Modular Neural Networks

Distilling the Knowledge in a Neural Network

Human-level control through deep reinforcement learning

How transferable are features in deep neural networks?

Network In Network

Adaptive Multi-Column Deep Neural Networks with Application to Robust Image Denoising

ELLA: An Efficient Lifelong Learning Algorithm

Lifelong Machine Learning Systems: Beyond Learning Algorithms

The Arcade Learning Environment: An Evaluation Platform for General Agents

Online Incremental Feature Learning with Denoising Autoencoders

Multi-column deep neural networks for image classification

Unsupervised and Transfer Learning Challenge: a Deep Learning Approach

Deep Learning of Representations for Unsupervised and Transfer Learning

An Introduction to Intertask Transfer for Reinforcement Learning

Natural Gradient Works Efficiently in Learning

Kk Kavukcuoglu

Supporting Online Material for Reducing the Dimensionality of Data with Neural Networks

Continual learning in reinforcement environments

The Cascade-Correlation Learning Architecture

Optimal Brain Damage

Field of Study

Journal Information

Name

Volume

Venue Information

Name

Type

URL

Alternate Names