H3WB: Human3.6M 3D WholeBody Dataset and Benchmark

This is the official repository for the paper "H3WB: Human3.6M 3D WholeBody Dataset and Benchmark". The repo contains Human3.6M 3D WholeBody (H3WB) annotations proposed in this paper.

For the 3D whole-body benchmark and results please refer to benchmark.md.

What is H3WB

H3WB is a large-scale dataset for 3D whole-body pose estimation. It is an extension of Human3.6m dataset and contains 133 whole-body (17 for body, 6 for feet, 68 for face and 42 for hands) keypoint annotations on 100K images. The skeleton layout is the same as COCO-Wholebody dataset.

Example annotations:

Layout from COCO-WholeBody: Image source.

H3WB Dataset

Download

Images can be downloaded from the official cite of Human3.6m dataset. We provide a data preparation script to compile Human3.6m videos into images which allows establishing correct correspondence between images and annotations.

The annotations can be downloaded from here and by default it is put under datasets/json/.

Annotation format

Every json is in the following structure, but not every json contains all these values. See Tasks section.

XXX.json --- sample id --- 'image_path'
                        |
                        -- 'bbox' --- 'x_min'
                        |          |- 'y_min'
                        |          |- 'x_max'
                        |          |- 'y_max'
                        |
                        |- 'keypont_2d' --- joint id --- 'x'
                        |                             |- 'y'
                        |
                        |- 'keypont_3d' --- joint id --- 'x'
                                                      |- 'y'
                                                      |- 'z'

We also provide a script to load json files.

Tasks

We propose 3 different tasks along with the 3D WholeBody dataset:

2D → 3D: 2D complete whole-body to 3D complete whole-body lifting

Use 2Dto3D_train.json for training and validation. It contains 80k 2D and 3D keypoints.
Use 2Dto3D_test_2d.json for test on leaderboard. It contains 10k 2D keypoints.

I2D → 3D: 2D incomplete whole-body to 3D complete whole-body lifting

Use 2Dto3D_train.json for training and validation. It contains 80k 2D and 3D keypoints.
Please apply masking on yourself during the training. The official masking strategy is as follows:
- With 40% probability, each keypoint has a 25% chance of being masked,
- with 20% probability, the face is entirely masked,
- with 20% probability, the left hand is entirely masked,
- with 20% probability, the right hand is entirely masked.
Use I2Dto3D_test_2d.json for test on leaderboard. It contains 10k 2D keypoints. Note that this test set is different from the 2Dto3D_test_2d.json.

RGB → 3D: Image to 3D complete whole-body prediction

Use RGBto3D_train.json for training and validation. It contains 80k image_path, bounding box and 2D keypoints.
It has the same samples from the 2Dto3D_train.json, so you can also access to 2D keypoints if needed.
Use RGBto3D_test_img.json for test on leaderboard. It contains 20k image_path and bounding box.
Note that the test sample ids are not aligned with previous 2 tasks.

Evaluation

Validation

We do not provide a validation set. We encourage researchers to report 5-fold cross-validation results with average and standard deviation values.

Evaluation on test set

Please save your 3D whole-body predictions on test set into 'XXto3D_pred.json', and send a downloadable link for the json file to wholebody3d@gmail.com with subject Test set evaluation request. An example file to upload for the test set predictions can be found here.

Visualization

We provide a function to visualize 3D whole-body, as well as the evaluation function for the leaderboard in this script.

Benchmark

Please refer to benchmark.md for the benchmark results.

Terms of Use

This project is released under the MIT License.
We do not own the copyright of the images. Use of the images must abide by the Human3.6m License agreement.

Citation

If you find H3WB 3D WholeBody dataset useful for your project, please cite our paper as follows.

Yue Zhu, Nermin Samet, David Picard, "H3WB: Human3.6M 3D WholeBody Dataset and benchmark", arXiv, 2022.

BibTeX entry:

@article{h3wb,
  title={H3WB: Human3.6M 3D WholeBody Dataset and benchmark},
  author={Yue Zhu, Nermin Samet, David Picard},
  booktitle={arXiv},    
  year={2022}
}

Please also consider citing the following works.

@article{h36m_pami,
 author = {Ionescu, Catalin and Papava, Dragos and Olaru, Vlad and Sminchisescu, Cristian},
 title = {Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments},
 journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
 publisher = {IEEE Computer Society},
 year = {2014}
} 
 
@inproceedings{IonescuSminchisescu11,
 author = {Catalin Ionescu, Fuxin Li, Cristian Sminchisescu},
 title = {Latent Structured Models for Human Pose Estimation},
 booktitle = {International Conference on Computer Vision},
 year = {2011}
}

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
datasets		datasets
imgs		imgs
utils		utils
LICENSE.md		LICENSE.md
README.md		README.md
benchmark.md		benchmark.md
test_leaderboard.py		test_leaderboard.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

H3WB: Human3.6M 3D WholeBody Dataset and Benchmark

What is H3WB

H3WB Dataset

Download

Annotation format

Tasks

2D → 3D: 2D complete whole-body to 3D complete whole-body lifting

I2D → 3D: 2D incomplete whole-body to 3D complete whole-body lifting

RGB → 3D: Image to 3D complete whole-body prediction

Evaluation

Validation

Evaluation on test set

Visualization

Benchmark

Terms of Use

Citation

About

Releases

Packages

Languages

License

mornydew/wholebody3d

Folders and files

Latest commit

History

Repository files navigation

H3WB: Human3.6M 3D WholeBody Dataset and Benchmark

What is H3WB

H3WB Dataset

Download

Annotation format

Tasks

2D → 3D: 2D complete whole-body to 3D complete whole-body lifting

I2D → 3D: 2D incomplete whole-body to 3D complete whole-body lifting

RGB → 3D: Image to 3D complete whole-body prediction

Evaluation

Validation

Evaluation on test set

Visualization

Benchmark

Terms of Use

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages