3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in viewpoint-estimation
No benchmarks available.
Use these libraries to find viewpoint-estimation models and implementations
No subtasks available.
A scalable and overfit-resistant image synthesis pipeline, together with a novel CNN specifically tailored for the viewpoint estimation task, is proposed that can significantly outperform state-of-the-art methods on PASCAL 3D+ benchmark.
This work proposes an iterative Semantic Pose Alignment Network, called iSPA-Net, which focuses on exploiting semantic 3D structural regularity to solve the task of fine-grained pose estimation by predicting viewpoint difference between a given pair of images.
Although conceptually simple, this method outperforms more complex and computationally expensive approaches that leverage semantic segmentation, instance level segmentation and flat ground priors and produces state of the art results for 3D viewpoint estimation on the Pascal 3D+ dataset.
By introducing a spherical exponential mapping on n-spheres at the regression output, this work obtains well-behaved gradients, leading to stable training and shows how the spherical regression can be utilized for several computer vision challenges, specifically viewpoint estimation, surface normal estimation and 3D rotation estimation.
It is shown that increasing the number of in-distribution combinations substantially improves generalization to OOD combinations, even with the same amount of training data, and it is demonstrated that such OOD generalization is facilitated by the neural mechanism of specialization.
This paper tackles the problems of few-shot object detection and few- shot viewpoint estimation, and introduces a simple category-agnostic viewpoint estimation method that outperforms state-of-the-art methods by a large margin on a range of datasets.
This work proposes a completely generic deep pose estimation approach, which does not require the network to have been trained on relevant categories, nor objects in a category to have a canonical pose, and demonstrates that this method boosts performances for supervised category pose estimation on standard benchmarks.
This work forms a solution to the adviser problem using a deep network and applies it to the viewpoint estimation problem where the question asks for the location of a specific keypoint in the input image, and is able to outperform the previous hybrid-intelligence state-of-the-art.
This work proposes a data-efficient method which utilizes the geometric regularity of intraclass objects for pose estimation, which enables this method to achieve state-of-the-art performance on multiple real-image viewpoint estimation datasets, such as Pascal3D+ and ObjectNet3D.
This work proposes a novel learning framework which incorporates an analysis-by-synthesis paradigm to reconstruct images in a viewpoint aware manner with a generative network, along with symmetry and adversarial constraints to successfully supervise the authors' viewpoint estimation network.
Adding a benchmark result helps the community track progress.