3260 papers • 126 benchmarks • 313 datasets
Open World Object Detection is a computer vision problem where a model is tasked to: 1) identify objects that have not been introduced to it as `unknown', without explicit supervision to do so, and 2) incrementally learn these identified unknown categories without forgetting previously learned classes, when the corresponding labels are progressively received.
(Image credit: Papersgraph)
These leaderboards are used to track progress in open-world-object-detection
Use these libraries to find open-world-object-detection models and implementations
No subtasks available.
A classification-free Object Localization Network (OLN) is proposed which estimates the objectness of each region purely by how well the location and shape of a region overlap with any ground-truth object.
This work proposes a novel computer vision problem called ORE: Open World Object Detector, which is based on contrastive clustering and energy based unknown identification, and finds that identifying and characterising unknown instances helps to reduce confusion in an incremental object detection setting.
This paper advocates that existing methods lack a top-down supervision signal governed by human-understandable semantics and demonstrates that Multi-modal Vision Transformers (MViT) trained with aligned image-text pairs can effectively bridge this gap.
Hyp-OW is proposed, a method that learns and models hierarchical representation of known items through a SuperClass Regularizer, which allows it to effectively detect unknown objects using a similarity distance-based relabeling module.
This work proposes an open-vocabulary object detection method that, based on image-caption pairs, learns to detect novel object classes along with a given set of known classes and shows that a simple language model fits better than a large contextualized language model for detecting novel objects.
This paper proposes the down-weight loss function and decoupled detection structure, and leverages the VL as the ``Brain'' of the open-world detector by simply generating unknown labels, to learn novel objects beyond VL through the authors' pseudo-labeling scheme.
A novel probabilistic framework for objectness estimation is introduced, where it alternate between probability distribution estimation and objectness likelihood maximization of known objects in the embedded feature space - ultimately allowing to estimate the objectness probability of different proposals.
This paper proposes five fundamental benchmark principles in line with the OWOD definition and constructs two OWOD benchmarks according to the principles for a fair evaluation and introduces a novel and effective OWOD framework with an auxiliary Proposal ADvisor (PAD) and a Class-specific Expelling Classifier (CEC).
This work formulate the problem and devise a two-stage object detector to solve UC-OWOD, a novel OWOD problem called Unknown-Classified Open World Object Detection, which aims to detect unknown instances and classify them into different unknown classes.
This work introduces a novel end-to-end transformer-based framework, OW-DETR, for open-world object detection that explicitly encodes multi-scale contextual information, possesses less inductive bias, enables knowledge transfer from known classes to the unknown class and can better discriminate between unknown objects and background.
Adding a benchmark result helps the community track progress.