3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in monocular-visual-odometry-10
No benchmarks available.
Use these libraries to find monocular-visual-odometry-10 models and implementations
No subtasks available.
Extensive experiments on the KITTI VO dataset show competitive performance to state-of-the-art methods, verifying that the end-to-end Deep Learning technique can be a viable complement to the traditional VO systems.
This work revisit the basics of VO and explore the right way for integrating deep learning with epipolar geometry and Perspective-n-Point method and design a simple but robust frame-to-frame VO algorithm (DF-VO) which outperforms pure deep learning-based and geometry-based methods.
This work proposes a method to carefully sample high-quality correspondences from deep flows and recover accurate camera poses with a geometric module, and addresses the scale-drift issue by aligning geometrically triangulated depths to thescale-consistent deep depths, where the dynamic scenes are taken into account.
A monocular depth estimation method SC-Depth, which requires only unlabelled videos for training and enables the scale-consistent prediction at inference time and the proposed hybrid Pseudo-RGBD SLAM shows compelling results in KITTI, and it generalizes well to the KAIST dataset without additional training.
A learning based approach to predict camera motion parameters directly from optic flow, by marginalizing depthmap variations and outliers by learning a sparse overcomplete basis set of egomotion in an autoencoder network.
The proposed method uses an additional camera to accurately estimate and optimize the scale of the monocular visual odometry, rather than triangulating 3D points from stereo matching, and it is computationally efficient, adding minimal overhead to the stereo vision system compared to straightforward stereo matching.
A comprehensive endoscopic SLAM dataset consisting of 3D point cloud data for six porcine organs, capsule and standard endoscopy recordings as well as synthetically generated data is introduced and Endo-SfMLearner, an unsupervised monocular depth and pose estimation method that combines residual networks with spatial attention module is propound.
This work presents WGANVO, a Deep Learning based monocular Visual Odometry method, where a neural network is trained to regress a pose estimate from an image pair using a semi-supervised approach.
This paper presents a 6-DoF monocular visual odometry that initializes instantly and without motion parallax, and shows that the proposed pose estimator outperforms the classical approaches for 6- doF pose estimation used in the literature in low-parallax configurations.
Adding a benchmark result helps the community track progress.