3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in fine-grained-visual-categorization-11
No benchmarks available.
Use these libraries to find fine-grained-visual-categorization-11 models and implementations
No subtasks available.
Through the comprehensive experiments demonstrate that the multi-branch and multi-scale learning network, MMAL-Net, has good classification ability and robustness for images of different scales and can achieves state-of-the-art results on CUB-200-2011, FGVC-Aircraft and Stanford Cars datasets.
A subset, expert-annotated to create a pilot dataset for apple scab, cedar apple rust, and healthy leaves, was made available to the Kaggle community for 'Plant Pathology Challenge'; part of the Fine-Grained Visual Categorization workshop at CVPR 2020.
This work proposes a novel Attribute-Mask RCNN model to jointly perform instance segmentation and localized attribute recognition, and provides a novel evaluation metric for the task.
A new deep FGVC model termed MetaFGNet is proposed, based on a novel regularized meta-learning objective, which aims to guide the learning of network parameters so that they are optimal for adapting to the target FGVC task.
This work proposes a novel classification-specific part estimation that uses an initial prediction as well as back-propagation of feature importance via gradient computations in order to estimate relevant image regions and shows the effectiveness of the mentioned part selection method in conjunction with the extracted part features.
A novel dataset FeatherV1, containing 28,272 images of feathers categorized by 595 bird species, was created to perform taxonomic identification of bird species by a single feather, which can be applied in amateur and professional ornithology.
An attention convolutional binary neural tree architecture is presented to address problems for weakly supervised Fine-grained visual categorization and uses the attention transformer module to enforce the network to capture discriminative features.
Adding a benchmark result helps the community track progress.