Optical Flow Estimation Using a Spatial Pyramid Network (2016-11-03T00:00:00.000000Z)

TL;DR

The Spatial Pyramid Network (SPyNet) is much simpler and 96% smaller than FlowNet in terms of model parameters, which makes it more efficient and appropriate for embedded applications.

Abstract

We learn to compute optical flow by combining a classical spatial-pyramid formulation with deep learning. This estimates large motions in a coarse-to-fine approach by warping one image of a pair at each pyramid level by the current flow estimate and computing an update to the flow. Instead of the standard minimization of an objective function at each pyramid level, we train one deep network per level to compute the flow update. Unlike the recent FlowNet approach, the networks do not need to deal with large motions, these are dealt with by the pyramid. This has several advantages. First, our Spatial Pyramid Network (SPyNet) is much simpler and 96% smaller than FlowNet in terms of model parameters. This makes it more efficient and appropriate for embedded applications. Second, since the flow at each pyramid level is small (

Authors

Michael J. Black

39 papers

Anurag Ranjan

5 papers

TL;DR

Abstract

Authors

References55 items

Deep Discrete Flow

Scalable Robust Principal Component Analysis using Grassmann Averages.

Back to Basics: Unsupervised Learning of Optical Flow via Brightness Constancy and Motion Smoothness

Fast Optical Flow Using Dense Inverse Search

Unsupervised convolutional neural networks for motion estimation

Understanding deep convolutional networks

Decoding MT motion response for optical flow estimation: An experimental evaluation

Deep Residual Learning for Image Recognition

A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

Deep End2End Voxel2Voxel Prediction

Deep multi-scale video prediction beyond mean square error

What can we expect from a V1-MT feedforward architecture for optical flow estimation?

Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks

Efficient sparse-to-dense optical flow estimation using a learned basis and layers

FlowNet: Learning Optical Flow with Convolutional Networks

EpicFlow: Edge-preserving interpolation of correspondences for optical flow

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

Adam: A Method for Stochastic Optimization

Fully convolutional networks for semantic segmentation

Fast Edge-Preserving PatchMatch for Large Displacement Optical Flow

Going deeper with convolutions

Optical Flow Estimation with Channel Constancy

ImageNet Large Scale Visual Recognition Challenge

A Quantitative Analysis of Current Practices in Optical Flow Estimation and the Principles Behind Them

A Naturalistic Open Source Movie for Optical Flow Evaluation

Are we ready for autonomous driving? The KITTI vision benchmark suite

Video Primal Sketch: A generic middle-level representation of video

Dense Point Trajectories by GPU-Accelerated Large Displacement Optical Flow

Convolutional Learning of Spatio-temporal Features

Secrets of optical flow estimation and their principles

Learning to Represent Spatial Transformations with Factored Higher-Order Boltzmann Machines

Large displacement optical flow

Fields of Experts

Learning Transformational Invariants from Natural Movies

Et al

Learning Optical Flow

A Database and Evaluation Methodology for Optical Flow

Optimal Filters for Extended Optical Flow

High Accuracy Optical Flow Estimation Based on a Theory for Warping

Learning sparse, overcomplete representations of time-varying natural images

Independent component analysis of natural image sequences yields spatio-temporal filters similar to simple cells in primary visual cortex

A model of neuronal responses in visual area MT

A framework for the robust estimation of optical flow

Performance of optical flow techniques

Model for the extraction of image flow.

Hierarchical Motion Detection

Robot vision

Spatiotemporal energy models for the perception of motion.

PYRAMID METHODS IN IMAGE PROCESSING.

The Laplacian Pyramid as a Compact Image Code

Determining Optical Flow

Author manuscript, published in "IEEE Intenational Conference on Computer Vision (ICCV), Sydney: Australie (2013)" DeepFlow: Large displacement optical flow with deep matching

Anisotropic Huber-L1 Optical Flow

J. Opt. Soc. Am. A

c ○ 2000 Kluwer Academic Publishers. Manufactured in The Netherlands. Learning Low-Level Vision

Field of Study

Journal Information

Name

Page

Venue Information

Name

Type

URL

Alternate Names