MGRNet: Multiple geometry representations for 6D object pose estimation in occluded or truncated scenes

This repository contains authors' implementation of MGRNet: 6D Object Pose Estimation under Multiple Geometry Representations. Our implementation is based on FFB6D and HybridPose. We warmly welcome any discussions related to our implementation and our paper. Please feel free to open an issue.

Introduction

Deep learning-based 6D object pose estimation methods from a single RGBD image have recently received increasing attention because of their powerful representation learning capabilities. These methods, however, cannot handle severe occlusion and truncation. In this paper, we present a novel 6D object pose estimation method based on multiple geometry representations. Specifically, we introduce a network to fuse the appearance and geometry features extracted from input color and depth images. Then, we utilize these per-point fusion features to estimate keypoint offsets, edge vectors, and dense symmetry correspondences in the canonical coordinate system. Finally, a two-stage pose regression module is applied to compute the 6D pose of an object. Relative to the unitary 3D keypoint-based strategy, such combinationof multiple geometry representations provides sufficient and diverse information, especially for occluded or truncated scenes.

Installation

Set up the python environment:

conda create -n mgrnet python=3.7
conda activate mgrnet

# install torch 1.8.1 built from cuda 11.1
pip install torch==1.8.1+cu111 torchvision==0.9.1+cu111 -f https://download.pytorch.org/whl/torch_stable.html

pip install -r requirement.txt

Install apex:

git clone https://github.com/NVIDIA/apex
export TORCH_CUDA_ARCH_LIST="6.0;6.1;6.2;7.0;7.5"  # set the target architecture manually, suggested in issue https://github.com/NVIDIA/apex/issues/605#issuecomment-554453001
pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./
cd ..

Install normalSpeed, a fast and light-weight normal map estimator:

git clone https://github.com/hfutcgncas/normalSpeed.git
cd normalSpeed/normalSpeed
python setup.py install --user
cd ..

Install tkinter through sudo apt get install python3-tk
Compile RandLA-Net operators:
```
cd models/RandLA/
sh compile_op.sh
```
Compile the pose regressor
```
cd lib/regressor
make
```

Dataset set-up

LineMOD: Download the preprocessed LineMOD dataset from onedrive link or google drive link. Unzip it and link the unzipped Linemod_preprocessed/ to datasets/linemod/Linemod_preprocessed:
```
ln -s path_to_unzipped_Linemod_preprocessed dataset/linemod/
```
Generate rendered and fused data following raster_triangle.
Download symmetries and generate sym_cor labels with label_real.py, label_render.py and label_fuse.py for each object.
Occlusion LineMOD: Download the Occlusion LineMOD from Google Drive or Tencent Weiyun. Unzip it and link the unzipped OcclusionChallengeICCV2015 to datasets/linemod/Occusion_Linemod:
```
ln -s path_to_unzipped_Occlusion_Linemod dataset/linemod/Occlusion_Linemod
```
Truncation LineMOD: Download our regenerated Truncation LineMOD dataset from Google Drive.
BOP LineMOD: Download the BOP-version LineMOD dataset from here mainly including lm_train_pbr.zip, lm_models.zip and lmo_test_all.zip. Unzip them and link to datasets/lmo/:
```
ln -s path_to_unzipped_train_pbr dataset/lmo/train_pbr
ln -s path_to_unzipped_models dataset/lmo/models_eval
ln -s path_to_unzipped_lmo_test dataset/lmo/test
```
Download symmetries and generate sym_cor labels with label_sym.py for all training and testing images.

Training and evaluating

Training on the LineMOD dataset

Train the model for the target object. You should change variable cls to 'ape' in script train_lm.sh and execute it in command line:
```
bash train_lm.sh
```

Evaluating on the Occlusion and Truncation LineMOD datasets

Before starting evaluation on certain object, you should change cls and tst_mdl in script test_lm.sh and execute it in command line:
```
bash test_lm.sh
```
The varible tst_mdl should be a path of pretrained model, which is stored in train_log/linemod/checkpoints/{cls}.
Pretrained model: We provide our pre-trained models for each object on link.

Training on the BOP LineMOD dataset

Because every training image of BOP LineMOD dataset contains label information of each object instance, we only need train one model for all of the 15 objects. Execute the following command for training: shell bash train_boplm.sh

Evaluating on the BOP LM-O dataset

Start evaluating by:
```
bash test_boplm.sh
```
Pretrained model: We provide our pre-trained models on link.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
assert		assert
datasets		datasets
lib		lib
models		models
utils		utils
README.md		README.md
common.py		common.py
requirements.txt		requirements.txt
test_boplm.sh		test_boplm.sh
test_lm.sh		test_lm.sh
train_boplm.py		train_boplm.py
train_boplm.sh		train_boplm.sh
train_lm.py		train_lm.py
train_lm.sh		train_lm.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MGRNet: Multiple geometry representations for 6D object pose estimation in occluded or truncated scenes

Introduction

Installation

Dataset set-up

Training and evaluating

Training on the LineMOD dataset

Evaluating on the Occlusion and Truncation LineMOD datasets

Training on the BOP LineMOD dataset

Evaluating on the BOP LM-O dataset

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MGRNet: Multiple geometry representations for 6D object pose estimation in occluded or truncated scenes

Introduction

Installation

Dataset set-up

Training and evaluating

Training on the LineMOD dataset

Evaluating on the Occlusion and Truncation LineMOD datasets

Training on the BOP LineMOD dataset

Evaluating on the BOP LM-O dataset

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages