Home Research Papers Datasets State of the Art Pricing

Discover, visualize, and connect AI research papers. Explore the latest trends and insights in artificial intelligence research.

Product

Home
Research Papers
About

Support

Contact
Terms of Service
Privacy Policy

© 2026 Papersgraph. All rights reserved.

computer-vision

Object Detection In Indoor Scenes

3260 papers • 126 benchmarks • 313 datasets

Object detection in indoor scenes is the task of performing object detection within an indoor environment. ( Image credit: Faster Bounding Box Annotation for Object Detection in Indoor Scenes )

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in object-detection-in-indoor-scenes

Trend

Dataset

Best Model

Actions

SUN RGB-D

SUN RGB-D

Libraries

i

Use these libraries to find object-detection-in-indoor-scenes models and implementations

Datasets

SUN RGB-D

Kitchen Scenes

ISOD

Transparent Object Images | Indoor Object Dataset

Transparent Object Images | Indoor Object Dataset

Stairs Image Dataset | Parts of House | Indoor

Stairs Image Dataset | Parts of House | Indoor

Subtasks

No subtasks available.

Most implemented papers

Frustum PointNets for 3D Object Detection from RGB-D Data

C. Qi, L. Guibas, Hao Su, W. Liu, Chenxia Wu•Tue Nov 21 2017

This work directly operates on raw point clouds by popping up RGBD scans and leverages both mature 2D object detectors and advanced 3D deep learning for object localization, achieving efficiency as well as high recall for even small objects.

2466

Content

Introduction Benchmarks Datasets Subtasks Libraries Papers

Mobile Phone Dataset | Smartphone & Feature Phone

Mobile Phone Dataset | Smartphone & Feature Phone

Suitcase/Luggage Dataset Indoor Object Image

Suitcase/Luggage Dataset Indoor Object Image

Masks Dataset | Unattended Mask Images

Masks Dataset | Unattended Mask Images

Electronics Object Image Dataset | Computer Parts

Electronics Object Image Dataset | Computer Parts

Bottles and Cups Dataset | Household Objects

Bottles and Cups Dataset | Household Objects

0

simCrossTrans: A Simple Cross-Modality Transfer Learning for Object Detection with ConvNets or Vision Transformers

Xiaoke Shen, I. Stamos•Sat Mar 19 2022

The approach simCrossTrans is named: simple cross-modality transfer learning with ConvNets or ViTs, which surpasses the previous state-of-the-art (SOTA) by a large margin and is easy to implement and expand.

5 0

Learning Rich Features from RGB-D Images for Object Detection and Segmentation

Ross B. Girshick, Pablo Arbeláez, Jitendra Malik, Saurabh Gupta•Sun Jul 20 2014

A new geocentric embedding is proposed for depth images that encodes height above ground and angle with gravity for each pixel in addition to the horizontal disparity to facilitate the use of perception in fields like robotics.

1612 0

You Only Need One Detector: Unified Object Detector for Different Modalities based on Vision Transformers

Xiaoke Shen, I. Stamos, Zhujun Li, Jaime Canizales•Fri Dec 31 2021

It is pointed out that by using a vision transformer together with cross/inter modality transfer learning, a uniﬁed detector can achieve better performances when using diﬀerent modalities as inputs.

1 0

Adding a benchmark result helps the community track progress.

Object Detection In Indoor Scenes | State-of-the-Art