End-to-end grasping policies for human-in-the-loop robots via deep reinforcement learning* (2021-04-26T00:00:00.000000Z)

TL;DR

A method for end-to-end training of a policy for human-in-the-loop robot grasping on real reaching trajectories using Reinforcement Learning and Imitation Learning in DEXTRON, a stochastic simulation environment with real human trajectories that are augmented and selected using a Monte Carlo simulation method.

Abstract

State-of-the-art human-in-the-loop robot grasping is hugely suffered by Electromyography (EMG) inference robustness issues. As a workaround, researchers have been looking into integrating EMG with other signals, often in an ad hoc manner. In this paper, we are presenting a method for end-to-end training of a policy for human-in-the-loop robot grasping on real reaching trajectories. For this purpose we use Reinforcement Learning (RL) and Imitation Learning (IL) in DEXTRON (DEXTerity enviRONment), a stochastic simulation environment with real human trajectories that are augmented and selected using a Monte Carlo (MC) simulation method. We also offer a success model which once trained on the expert policy data and the RL policy roll-out transitions, can provide transparency to how the deep policy works and when it is probably going to fail.

Authors

M. Sharif

1 papers

D. Erdoğmuş

1 papers

Chris Amato

1 papers

TL;DR

Abstract

Authors

References40 items

Towards End-to-End Control of a Robot Prosthetic Hand via Reinforcement Learning

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Particle Filters vs Hidden Markov Models for Prosthetic Robot Hand Grasp Selection

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Shared human–robot proportional control of a dexterous myoelectric prosthesis

Vision-based grasp learning of an anthropomorphic hand-arm system in a synergy-based control framework

Human Adaptation to Human–Robot Shared Control

HTC Vive: Analysis and Accuracy Improvement

SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning

Can We Achieve Intuitive Prosthetic Elbow Control Based on Healthy Upper Limb Motor Strategies?

Reinforcement Learning from Imperfect Demonstrations

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

DeepMind Control Suite

Real-time robustness evaluation of regression based myoelectric control against arm position change and donning/doffing

Visual Cues to Improve Myoelectric Control of Upper Limb Prostheses

Learning from demonstration: Teaching a myoelectric prosthesis with an intact limb via reinforcement learning

Domain randomization for transferring deep neural networks from simulation to the real world

Human-Robot Mutual Adaptation in Shared Autonomy

(CAD)$^2$RL: Real Single-Image Flight without a Single Real Image

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates

The Reality of Myoelectric Prostheses: Understanding What Makes These Devices Difficult for Some Users to Control

Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection

MuJoCo HAPTIX: A virtual reality system for hand manipulation

Current state of digital signal processing in myoelectric interfaces and related applications

Adam: A Method for Stochastic Optimization

Extracting Signals Robust to Electrode Number and Shift for Online Simultaneous and Proportional Myoelectric Control by Factorization Algorithms

MuJoCo: A physics engine for model-based control

Online human training of a myoelectric prosthesis controller via actor-critic reinforcement learning

AprilTag: A robust and flexible visual fiducial system

Determining the Optimal Window Length for Pattern Recognition-Based Myoelectric Control: Balancing the Competing Effects of Classification Error and Controller Delay

A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning

Cognitive vision system for control of dexterous prosthetic hands: Experimental evaluation

Movement characteristics of upper extremity prostheses during basic goal-directed tasks.

Gradual molding of the hand to object contours.

Collaborative Plans for Complex Group Action

Models of Trajectory Formation and Temporal Interaction of Reach and Grasp.

Digideep: A DeepRL pipeline for developers

The role of trust in human-robot interaction

Bebionic Prosthetic Design

Quaternions, Interpolation and Animation

Field of Study

Journal Information

Name

Page

Venue Information

Name

Type

URL

Alternate Names