Easy Face Detection

Proof of concept for including post and pre-processing steps inside onnx model.

In simple words you pass [width, height, 3] uint8 tensor to the model and you get back key points, scores, and bounding boxes.

Example inference in python can be found here, and models files here

Naming convention

Model name starts with the name of the original model, if size_adjust is included in the name model will accept dynamically sized tensor and adjust the detection results to the size of the original image. The last number is the size on which the model performs inference.

Upsides

Ease of you in any language and runtime of your choice
Reduction of duplicate code for decoding detection results

Downsides

Batch inference
- Keras can't produce inputs without batch dimension. So you need to modify the graph directly if you achieve that.
- Exported models have hardcoded batch dimensions to 1
- Impossibility of concatination ragged tensors with more than one dimension
- Proof of concept code can be found here
Dynamic input size without resizing
- SCRFD models can accept the image that size is dividable by 32, but I couldn't make it work.
Image resizing could be not as efficient as a native implementation
The outputs names produced by tf2onnx are dependent on the number of layers that TensorFlow created, so code won't work if you run in the wrong order or you try to generate two models in the same runtime.
Tools for modifying and manipulating onnx graphs have rough corners and are not really mature

Motivation and afterthoughts

The original motivation was the need to re-write decoding steps while using another detection model trained on the coco dataset in nodejs using onnxruntime.

Also, I needed to preprocess the image - normalize it, add batch dimension, convert it to float32.

I am not arguing that including everything inside your model is the best or even good choice but changing your processing pipeline from NumPy and cv2 to your deep learning framework of choice would allow the conversion to other formats, and potentially save some redundant code when you need your model in another language and shouldn't be a lot of trouble when you have original code with original weights and are not mixing frameworks as shown in this repo.

License

The code in this repository is released under the MIT License.

But base weights for models come from InsightFace python package, and they are available for non-commercial research purposes only,

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
models		models
LICENSE		LICENSE
README.md		README.md
batch_example.ipynb		batch_example.ipynb
convert.ipynb		convert.ipynb
example.jpg		example.jpg
inference_example.ipynb		inference_example.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

models

models

LICENSE

LICENSE

README.md

README.md

batch_example.ipynb

batch_example.ipynb

convert.ipynb

convert.ipynb

example.jpg

example.jpg

inference_example.ipynb

inference_example.ipynb

Repository files navigation

Easy Face Detection

Naming convention

Upsides

Downsides

Motivation and afterthoughts

License

About

Releases

Packages

Languages

License

magicaltoast/easy-facedetection

Folders and files

Latest commit

History

Repository files navigation

Easy Face Detection

Naming convention

Upsides

Downsides

Motivation and afterthoughts

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages