GitHub - RealNiceBoat/CV_Detectron2: Team NiceBoat Submission for DSTA TIL 2020 (machine vision + robotics)

Installation

pip install torch torchvision
pip install git+https://github.com/facebookresearch/detectron2.git
pip install git+https://github.com/jinmingteo/cocoapi.git#subdirectory=PythonAPI
pip install opencv-python

Description

See this notebook for a overview of our training process. For the pure TIL model, here is quick setup guide:

Go here for the data and model. Password is: niceboat_til2020.
Put the train & val data folders back into til2020. (I already fixed the annotation json)
Download the model (ft-til_resnet101_rcnn_moda_aug-147999-best_val.pth) and place it into final_ckpts.
To add more augmentation components, see pipeline.py.

Most of the important logic has been extracted into their own python files which can be found in the scripts folder, which also contains some utility scripts. There may also be other utility scripts scattered elsewhere for the purpose of wrangling datasets.

Citations

The library used is Facebook's Detectron2. The model is R101-FPN aka ResNet-101 with Feature Pyramid Network. As such, we need to cite:

@misc{wu2019detectron2,
  author =       {Yuxin Wu and Alexander Kirillov and Francisco Massa and
                  Wan-Yen Lo and Ross Girshick},
  title =        {Detectron2},
  howpublished = {\url{https://github.com/facebookresearch/detectron2}},
  year =         {2019}
}

Ultimately, we found the TIL dataset insufficient and after researching and experimenting with multiple datasets, we stumbled across Modanet, annotations based on the Paper Doll dataset that were of extremely high quality, allowing our model to reach much greater AP scores.

@inproceedings{zheng/2018acmmm,
  author       = {Shuai Zheng and Fan Yang and M. Hadi Kiapour and Robinson Piramuthu},
  title        = {ModaNet: A Large-Scale Street Fashion Dataset with Polygon Annotations},
  booktitle    = {ACM Multimedia},
  year         = {2018},
}

@inproceedings{yamaguchi/iccv2013,
  author =       {Kota Yamaguchi and M. Hadi Kiapour and Tamara L. Berg},
  title =        {Paper Doll Parsing: Retrieving Similar Styles to Parse Clothing Items},
  booktitle    = {ICCV 2013},
  year =         {2013}
}

One last dataset we used was the DeepFashion2 Dataset. It is a very large dataset, and unfortunately too good to be true, being of low quality as evidenced by our tried and tested models being unable to train well on it.

@article{DeepFashion2,
  author = {Yuying Ge and Ruimao Zhang and Lingyun Wu and Xiaogang Wang and Xiaoou Tang and Ping Luo},
  title={A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images},
  journal={CVPR},
  year={2019}
}

R101-FPN COCO keypoints finetuned purely on ModaNet

TIL pycoco evaluation results:

IoU=0.20:0.50	IoU=0.20	IoU=0.30	IoU=0.40	IoU=0.50
0.761	0.770	0.764	0.758	0.751

[06/18 19:20:02 d2.evaluation.coco_evaluation]: Evaluation results for bbox:

AP	AP50	AP75	APs	APm	APl
76.096	76.954	76.402	75.818	75.102	0.000

[06/18 19:20:02 d2.evaluation.coco_evaluation]: Per-category bbox AP:

category	AP	category	AP	category	AP
tops	65.887	trousers	70.078	outerwear	84.022
dresses	96.491	skirts	64.001

R101-FPN COCO obj det finetuned purely on TIL dataset

TIL pycoco evaluation results:

IoU=0.20:0.50	IoU=0.20	IoU=0.30	IoU=0.40	IoU=0.50
0.687	0.701	0.694	0.686	0.665

[06/13 20:35:19 d2.evaluation.coco_evaluation]: Evaluation results for bbox:

AP	AP50	AP75	APs	APm	APl
68.744	70.114	69.380	68.594	66.490	0.000

[06/15 15:51:35 d2.evaluation.coco_evaluation]: Per-category bbox AP:

category	AP	category	AP	category	AP
tops	58.582	trousers	49.809	outerwear	78.678
dresses	96.898	skirts	59.754

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
ckpts		ckpts
data		data
final_ckpts		final_ckpts
notebooks		notebooks
.gitignore		.gitignore
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ckpts

ckpts

data

data

final_ckpts

final_ckpts

notebooks

notebooks

.gitignore

.gitignore

readme.md

readme.md

Repository files navigation

Installation

Description

Citations

R101-FPN COCO keypoints finetuned purely on ModaNet

R101-FPN COCO obj det finetuned purely on TIL dataset

About

Releases

Packages

Languages

RealNiceBoat/CV_Detectron2

Folders and files

Latest commit

History

Repository files navigation

Installation

Description

Citations

R101-FPN COCO keypoints finetuned purely on ModaNet

R101-FPN COCO obj det finetuned purely on TIL dataset

About

Topics

Resources

Stars

Watchers

Forks

Languages