Sample Efficient Grasp Learning Using Equivariant Models

Abstract

In planar grasp detection, the goal is to learn a function from an image of a scene onto a set of feasible grasp poses in SE(2). In this paper, we recognize that the optimal grasp function is SE(2)-equivariant and can be modeled using an equivariant convolutional neural network. As a result, we are able to significantly improve the sample efficiency of grasp learning, obtaining a good approximation of the grasp function after only 600 grasp attempts. This is few enough that we can learn to grasp completely on a physical robot in about 1.5 hours.

Paper Website Video

Minimum Code* Complete Code*

*Note that the minimum code only includes our method while the complete code includes all baselines in our paper.

Citation

@article{zhu2022grasp,
  title={Sample Efficient Grasp Learning Using Equivariant Models},
  author={Zhu, Xupeng and Wang, Dian and Biza, Ondrej and Su, Guanang and Walters, Robin and Platt, Robert},
  journal={Proceedings of Robotics: Science and Systems (RSS)},
  year={2022} }

Environments

Simulation Environment

The simulation environment is random_household_picking_clutter_full_obs_30. This environment is implemented in /helping_hands_rl_envs/envs/pybullet_envs.

Physical Environment

The physical robot environment is DualBinFrontRear. To train on this environment, a physical robot set up is required.

Installation

You can install the required packages either through Option1: anaconda or through Option2: pip.

Option1: anaconda

Install anaconda

Create and activate a conda virtual environment with python3.7.

sudo apt update
conda create -n eqvar_grasp python=3.7
conda activate eqvar_grasp

Download the git repository.

git clone https://github.com/ZXP-S-works/SE2-equivariant-grasp-learning.git
cd SE2-equivariant-grasp-learning

Install PyTorch (Recommended: pytorch==1.8.1, torchvision==0.9.1)
Install CuPy
```
conda install -c conda-forge cupy
```
Install other requirement packages
```
pip install -r requirements.txt
```

Clone and install the environment repo

git clone https://github.com/ColinKohler/helping_hands_rl_envs.git -b xupeng_realistic
cd helping_hands_rl_envs
pip install -r requirements.txt
cd ..

Go to the scripts folder of this repo to run experiments
```
cd asrse3/scripts
```

Option2: pip

Install python3.7

Download the git repository.

git clone https://github.com/ZXP-S-works/SE2-equivariant-grasp-learning.git
cd SE2-equivariant-grasp-learning

Install PyTorch (Recommended: pytorch==1.8.1, torchvision==0.9.1)
Install CuPy
Install other requirement packages
```
pip install -r requirements.txt
```

Clone and install the environment repo

git clone https://github.com/ColinKohler/helping_hands_rl_envs.git -b xupeng_realistic
cd helping_hands_rl_envs
pip install -r requirements.txt
cd ..

Go to the scripts folder of this repo to run experiments
```
cd asrse3/scripts
```

Reinforcement learning

Training baselines in simulation

Our method

python3 ./scripts/main.py

To visualize the simulation and the policy learning, set --render=t.

To load the trained model and visualize the learned policy, you can run the following code:

python3 ./scripts/main.py
--log_pre="PATH_TO_SAVE_THE_LOG"
--step_eps=0
--init_eps=0
--render=t
--train_tau=0.002
--training_offset=10000
--load_model_pre="PATH_TO_THE_MODEL"
--ADITTIONAL_PARAMETERS_FOR_YOUR_MODEL

Where the "PATH_TO_THE_MODEL" is the path to the trained model, without _qx.pt. For example --load_model_pre="/results/household_repo/snapshot_random_household_picking_clutter_full_obs".

In addition, ADITTIONAL_PARAMETERS_FOR_YOUR_MODEL should be set correspondingly, for example --model, --alg, --action_selection.

Real-time training in a physical robot

The parallel training is only implemented in physical robot environment (code for physical robot environment is coming soon). However, one can easily modify it to any environment.

python3 ./scripts/train_robot_parallel.py --env=DualBinFrontRear --hm_threshold=0.015 --step_eps=20 --init_eps=1. --final_eps=0.

The right figure illustrates the parallel training.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
agents		agents
block-insertion-test		block-insertion-test
block-insertion-train		block-insertion-train
block-insertion-train_bak		block-insertion-train_bak
helping_hands_rl_envs		helping_hands_rl_envs
images		images
logs		logs
networks		networks
raven		raven
scripts		scripts
storage		storage
utils		utils
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
command.txt		command.txt
out_test.py		out_test.py
params.json		params.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sample Efficient Grasp Learning Using Equivariant Models

Paper Website Video

Minimum Code* Complete Code*

Environments

Simulation Environment

Physical Environment

Installation

Option1: anaconda

Option2: pip

Reinforcement learning

Training baselines in simulation

Real-time training in a physical robot

About

Releases

Packages

Languages

License

titanior/equivariant-kitting-

Folders and files

Latest commit

History

Repository files navigation

Sample Efficient Grasp Learning Using Equivariant Models

Paper Website Video

Minimum Code* Complete Code*

Environments

Simulation Environment

Physical Environment

Installation

Option1: anaconda

Option2: pip

Reinforcement learning

Training baselines in simulation

Real-time training in a physical robot

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages