computer-vision-10

Motion Estimation

3260 papers • 126 benchmarks • 313 datasets

Motion Estimation is used to determine the block-wise or pixel-wise motion vectors between two frames. Source: MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in motion-estimation-10

Trend

Dataset

Best Model

Actions

No benchmarks available.

Libraries

Use these libraries to find motion-estimation-10 models and implementations

uzh-rpg/event-based_vision_resources

2 papers 2,620

Datasets

Subtasks

No subtasks available.

Most implemented papers

Digging Into Self-Supervised Monocular Depth Estimation

Oisin Mac Aodha, G. Brostow, Clément Godard•Sun Jun 03 2018

Per-pixel ground-truth depth data is challenging to acquire at scale. To overcome this limitation, self-supervised learning has emerged as a promising alternative for training models to perform monocular depth estimation. In this paper, we propose a set of improvements, which together result in both quantitatively and qualitatively improved depth maps compared to competing self-supervised methods. Research on self-supervised monocular training usually explores increasingly complex architectures, loss functions, and image formation models, all of which have recently helped to close the gap with fully-supervised methods. We show that a surprisingly simple model, and associated design choices, lead to superior predictions. In particular, we propose (i) a minimum reprojection loss, designed to robustly handle occlusions, (ii) a full-resolution multi-scale sampling method that reduces visual artifacts, and (iii) an auto-masking loss to ignore training pixels that violate camera motion assumptions. We demonstrate the effectiveness of each component in isolation, and show high quality, state-of-the-art results on the KITTI benchmark.

Content

Motion Estimation | State-of-the-Art

Motion Estimation

Benchmarks

Libraries

Datasets

Subtasks

Most implemented papers

Digging Into Self-Supervised Monocular Depth Estimation

Content

Depth Prediction Without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos

On Human Motion Prediction Using Recurrent Neural Networks

The Double Sphere Camera Model

Visual-Inertial Mapping With Non-Linear Factor Recovery

DeepVO: Towards end-to-end visual odometry with deep Recurrent Convolutional Neural Networks

Video Enhancement with Task-Oriented Flow

FastDVDnet: Towards Real-Time Deep Video Denoising Without Flow Estimation

Castle in the Sky: Dynamic Sky Replacement and Harmonization in Videos

Scalable Scene Flow From Point Clouds in the Real World