Panoptic Segmentation

Panoptic Segmentation is a computer vision task that combines semantic segmentation and instance segmentation to provide a comprehensive understanding of the scene. The goal of panoptic segmentation is to segment the image into semantically meaningful parts or regions, while also detecting and distinguishing individual instances of objects within those regions. In a given image, every pixel is assigned a semantic label, and pixels belonging to "things" classes (countable objects with instances, like cars and people) are assigned unique instance IDs. ( Image credit: Detectron2 )

Benchmarks

Libraries

Datasets

Subtasks

Most implemented papers

Mask R-CNN

Content

End-to-End Object Detection with Transformers

ResNeSt: Split-Attention Networks

Visual attention network

PVT v2: Improved baselines with Pyramid Vision Transformer

SOLOv2: Dynamic and Fast Instance Segmentation

Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation

Panoptic Segmentation

Panoptic Feature Pyramid Networks

Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation