A new benchmark for group distribution shifts in hand grasp regression for object manipulation. Can meta-learning raise the bar? (2022-10-31T00:00:00.000000Z)

TL;DR

A novel benchmark for object group distribution shifts in hand and object pose regression for object grasping is proposed and the hypothesis that meta-learning a baseline pose regression neural network can adapt to these shifts and generalize better to unknown objects is tested.

Abstract

Understanding hand-object pose with computer vision opens the door to new applications in mixed reality, assisted living or human-robot interaction. Most methods are trained and evaluated on balanced datasets. This is of limited use in real-world applications; how do these methods perform in the wild on unknown objects? We propose a novel benchmark for object group distribution shifts in hand and object pose regression. We then test the hypothesis that meta-learning a baseline pose regression neural network can adapt to these shifts and generalize better to unknown objects. Our results show measurable improvements over the baseline, depending on the amount of prior knowledge. For the task of joint hand-object pose regression, we observe optimization interference for the meta-learner. To address this issue and improve the method further, we provide a comprehensive analysis which should serve as a basis for future work on this benchmark. adaptation (TTA) methods demonstrate distribution in-the-wild in and object segmentation We propose to achieve this on the grasp prediction problem with a meta-learning algorithm, where the to quickly learn new tasks from a few examples at test time 17, 24]. We then evaluate this method on a novel benchmark for group distribution shifts in hand-object pose regression for object grasping. How does the performance of a CNN pose predictor evolve as the test set grasps diverge from the training set? We answer this question and look at the advantages and limitations of meta-learning for this application via experiments and empirical analysis.

Authors

Théo Morales

1 papers

G. Lacey

1 papers

A new benchmark for group distribution shifts in hand grasp regression for object manipulation. Can meta-learning raise the bar?

TL;DR

Abstract

Authors

References25 items

ArtiBoost: Boosting Articulated 3D Hand-Object Pose Estimation via Online Exploration and Synthesis

Test-Time Personalization with a Transformer for Human Pose Estimation

HO-3D_v3: Improving the Accuracy of Hand-Object Annotations of the HO-3D Dataset

Survey on depth and RGB image-based 3D hand shape and pose estimation

DexYCB: A Benchmark for Capturing Hand Grasping of Objects

Online Adaptation for Consistent Mesh Reconstruction in the Wild

HOT-Net: Non-Autoregressive Transformer for 3D Hand-Object Pose Estimation

A survey of deep meta-learning

learn2learn: A Library for Meta-Learning Research

Meta-Learning Requires Meta-Augmentation

Online Meta Adaptation for Fast Video Object Segmentation

Meta-Learning without Memorization

Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML

HOnnotate: A Method for 3D Annotation of Hand and Object Poses

Few-Shot Adaptive Gaze Estimation

How to train your MAML

First-Person Hand Action Benchmark with RGB-D Videos and 3D Hand Pose Annotations

Learning to learn by gradient descent by gradient descent

What can the brain teach us about building artificial intelligence?

Deep Residual Learning for Image Recognition

ImageNet Large Scale Visual Recognition Challenge

Learning how to learn

Statistical Shape Analysis

Experiment tracking with weights and biases, 2020

trimesh, 2019

Field of Study

Journal Information

Name

Volume

Venue Information

Name

Type

URL

Alternate Names