Real-Time Object Detection

Real-Time Object Detection is a computer vision task that involves identifying and locating objects of interest in real-time video sequences with fast inference while maintaining a base level of accuracy. This is typically solved using algorithms that combine object detection and tracking techniques to accurately detect and track objects in real-time. They use a combination of feature extraction, object proposal generation, and classification to detect and localize objects of interest. ( Image credit: CenterNet )

Benchmarks

Libraries

Datasets

Subtasks

Most implemented papers

YOLOv3: An Incremental Improvement

Content

YOLO9000: Better, Faster, Stronger

YOLOv4: Optimal Speed and Accuracy of Object Detection

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Mask R-CNN

You Only Look Once: Unified, Real-Time Object Detection

CSPNet: A New Backbone that can Enhance Learning Capability of CNN

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Objects as Points