3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in foreground-segmentation-1
No benchmarks available.
Use these libraries to find foreground-segmentation-1 models and implementations
No datasets available.
No subtasks available.
One-Shot Video Object Segmentation (OSVOS), based on a fully-convolutional neural network architecture that is able to successively transfer generic semantic information, learned on ImageNet, to the task of foreground segmentation, and finally to learning the appearance of a single annotated object of the test sequence (hence one-shot).
A neural head reenactment system, which is driven by a latent pose representation and is capable of predicting the foreground segmentation alongside the RGB image, and shows that the learned descriptors are useful for other pose-related tasks, such as keypoint prediction and pose-based retrieval.
This paper takes inspiration from the widely-used pre-training and then prompt tuning protocols in NLP and proposes a new visual prompting model, named Explicit Visual Prompting (EVP), which freezes a pre-trained model and then learns task-specific knowledge using a few extra parameters.
This work introduces PartSTAD, a method designed for the task adaptation of 2D-to-3D segmentation lifting that finetunes a 2D bounding box prediction model with an objective function for 3D segmentation.
Two robust encoder-decoder type neural networks that generate multi-scale feature encodings in different ways and can be trained end-to-end using only a few training samples are proposed.
An improved model that attends to multiple candidate locations, generates segmentation proposals to mask out background clutter and selects among the segmented objects is introduced, suggesting that such image recognition models based on an iterative refinement of object detection and foreground segmentation may provide a way to deal with highly cluttered scenes.
This work proposes a variation of the formerly proposed FgSegNet method that can be trained end-to-end using only a few training examples, and outperforms all existing state-of-the-art methods in CDnet2014 datasets by an average overall F-measure.
A new method to simultaneously tackle multispectral segmentation and stereo registration is presented, based on the alternating minimization of two energy functions that are linked through the use of dynamic priors.
This method performs data augmentation that not only creates endless data on the fly, but also features semantic transformations of illumination which enhance the generalisation of the model.
Adding a benchmark result helps the community track progress.