3260 papers • 126 benchmarks • 313 datasets
Weakly Supervised Object Detection (WSOD) is the task of training object detectors with only image tag supervisions. ( Image credit: Soft Proposal Networks for Weakly Supervised Object Localization )
(Image credit: Papersgraph)
These leaderboards are used to track progress in weakly-supervised-object-detection
Use these libraries to find weakly-supervised-object-detection models and implementations
No subtasks available.
This paper proposes a weakly supervised deep detection architecture that modifies one such network to operate at the level of image regions, performing simultaneously region selection and classification.
This paper proposes a two-step progressive domain adaptation technique by fine-tuning the detector on two types of artificially and automatically generated samples, and achieves an improvement of approximately 5 to 20 percentage points in terms of mean average precision (mAP) compared to the best-performing baselines.
This work formulate weakly supervised object detection as a Multiple Instance Learning (MIL) problem, where instance classifiers (object detectors) are put into the network as hidden nodes and instance labels inferred from weak supervision are propagated to their spatially overlapped instances to refine instance classifier online.
This work proposes a method for the weakly supervised detection of objects in paintings, and introduces a new database, IconArt, on which it performs detection experiments on classes that could not be learned on photographs, such as Jesus Child or Saint Sebastian.
This work develops an instance-aware and context-focused unified framework for weakly supervised video object detection that achieves state-of-the-art results on COCO, VOC 2007, and VOC 2012 while devising a memory-efficient sequential batch back-propagation.
This work shows how self-supervised learning, based on a teacher-student network with a modified student network update design, can be used to build face and body detectors and demonstrates that style transfer can be incorporated into the learning pipeline to bootstrap detectors using a vast amount of out-of-domain labeled images from natural images.
This paper is the first showing that a self-paced approach can be used with deep-network-based classifiers in an end-to-end training pipeline, built on the fully-supervised Fast-RCNN architecture and can be applied to similar architectures which represent the input image as a bag of boxes.
Despite its simplicity, the proposed weakly supervised object detection method shows competitive results on a range of publicly available datasets, including paintings, watercolors, cliparts and comics and allows to quickly learn unseen visual categories.
This paper first shows that instances can be assigned object or background labels directly based on proposal clusters for instance classifier refinement, and then shows that treating each cluster as a small new bag yields fewer ambiguities than the directly assigning label method.
Adding a benchmark result helps the community track progress.