3260 papers • 126 benchmarks • 313 datasets
Object proposal generation is a preprocessing technique that has been widely used in current object detection pipelines to guide the search of objects and avoid exhaustive sliding window search across images. ( Image credit: Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation )
(Image credit: Papersgraph)
These leaderboards are used to track progress in object-proposal-generation
Use these libraries to find object-proposal-generation models and implementations
No subtasks available.
Extensive experiments on the 3D detection benchmark of KITTI dataset show that the proposed architecture outperforms state-of-the-art methods with remarkable margins by using only point cloud as input.
This work proposes a novel end-to-end deep semantic edge learning architecture based on ResNet and a new skip-layer architecture where category-wise edge activations at the top convolution layer share and are fused with the same set of bottom layer features.
This paper proposes Multi-View 3D networks (MV3D), a sensory-fusion framework that takes both LIDAR point cloud and RGB images as input and predicts oriented 3D bounding boxes and designs a deep fusion scheme to combine region-wise features from multiple views and enable interactions between intermediate layers of different paths.
A differentiable, end-to-end trainable framework for solving pixel-level grouping problems such as instance segmentation consisting of two novel components, implementing a variant of mean-shift clustering as a recurrent neural network parameterized by kernel bandwidth.
This paper proposes a unified approach for bottom-up hierarchical image segmentation and object proposal generation for recognition, called Multiscale Combinatorial Grouping (MCG), and develops a fast normalized cuts algorithm and proposes a high-performance hierarchical segmenter that makes effective use of multiscale information.
The selective convolutional descriptor aggregation (SCDA) method is proposed, which is unsupervised, using no image label or bounding box annotation, and achieves comparable retrieval results with the state-of-the-art general image retrieval approaches.
A new method for semantic instance segmentation is proposed, by first computing how likely two pixels are to belong to the same object, and then by grouping similar pixels together, based on a deep, fully convolutional embedding model.
This paper proposes object proposal generation based on non-parametric Bayesian inference that allows quantification of the likelihood of the proposals, and applies Markov chain Monte Carlo to draw samples of image segmentations via the distance dependent Chinese restaurant process.
It is shown that the proposed modification of the post-processing phase that uses high-scoring object detections from nearby frames to boost scores of weaker detections within the same clip obtains superior results to state-of-the-art single image object detection techniques.
Convolutional Channel Features (CCF) serves as a good way of tailoring pre-trained CNN models to diverse tasks without fine-tuning the whole network to each task by achieving state-of-the-art performances in pedestrian detection, face detection, edge detection and object proposal generation.
Adding a benchmark result helps the community track progress.