Name		Name	Last commit message	Last commit date
parent directory ..
configs		configs
demo		demo
deploy		deploy
docs		docs
ppmatting		ppmatting
tests		tests
tools		tools
README.md		README.md
README_CN.md		README_CN.md
requirements.txt		requirements.txt

README.md

English | 简体中文

Image Matting

Introduction

Image Matting is the technique of extracting foreground from an image by calculating its color and transparency. It is widely used in the film industry to replace background, image composition, and visual effects. Each pixel in the image will have a value that represents its foreground transparency, called Alpha. The set of all Alpha values in an image is called Alpha Matte. The part of the image covered by the mask can be extracted to complete foreground separation.

Update Notes

2022.11
- Release self developed lite matting SOTA model PP-MattingV2. Compared with MODNet, the inference speed of PP-MattingV2 is increased by 44.6%, and the average error is decreased by 17.91%.
- Adjust the document structure and improve the model zoo information.
- FastDeploy support PP-MattingV2, PP-Matting, PP-HumanMatting and MODNet models.
2022.07
- Release PP-Matting code. Add ClosedFormMatting, KNNMatting, FastMatting, LearningBaseMatting and RandomWalksMatting traditional machine learning algorithms. Add GCA model.
- upport to specify metrics for evaluation. Support to specify metrics for evaluation.
2022.04
- Release self developed high accuracy matting SOTA model PP-Matting. Add PP-HumanMatting high-resolution human matting model.
- Add Grad, Conn evaluation metrics. Add foreground evaluation funciton, which use ML algorithm to evaluate foreground when prediction or background replacement.
- Add GradientLoss and LaplacianLoss. Add RandomSharpen, RandomSharpen, RandomReJpeg, RSSN data augmentation strategies.
2021.11
- Matting Project is released, which Realizes image matting function.
- Support Matting models: DIM, MODNet. Support model export and python deployment. Support background replacement function. Support human matting deployment in Android.

Community

If you have any questions, suggestions and feature requests, please create an issues in GitHub Issues.
Welcome to scan the following QR code and join paddleseg wechat group to communicate with us.

Models

For the widely application scenario -- human matting, we have trained and open source the ** high-quality human matting models**. According the actual application scenario, you can directly deploy or finetune.

The model zoo includes our self developded high accuracy model PP-Matting and lite model PP-MattingV2.

PP-Matting is a high accuracy matting model developded by PaddleSeg, which realizes high-resolution image matting under semantic guidance by the design of Guidance Flow. For high accuracy, this model is recommended. Two pre-trained models are opened source with 512 and 1024 resolution level.
PP-MattingV2 is a lite matting SOTA model developed by PaddleSeg. It extracts high-level semantc informating by double-pyramid pool and spatial attention, and uses multi-level feature fusion mechanism for both semantic and detail prediciton.

Model	SAD	MSE	Grad	Conn	Params(M)	FLOPs(G)	FPS	Config File	Checkpoint	Inference Model
PP-MattingV2-512	40.59	0.0038	33.86	38.90	8.95	7.51	98.89	cfg	model	model inference
PP-Matting-512	31.56	0.0022	31.80	30.13	24.5	91.28	28.9	cfg	model	model inference
PP-Matting-1024	66.22	0.0088	32.90	64.80	24.5	91.28	13.4(1024X1024)	cfg	model	model inference
PP-HumanMatting	53.15	0.0054	43.75	52.03	63.9	135.8 (2048X2048)	32.8(2048X2048)	cfg	model	model inference
MODNet-MobileNetV2	50.07	0.0053	35.55	48.37	6.5	15.7	68.4	cfg	model	model inference
MODNet-ResNet50_vd	39.01	0.0038	32.29	37.38	92.2	151.6	29.0	cfg	model	model inference
MODNet-HRNet_W18	35.55	0.0035	31.73	34.07	10.2	28.5	62.6	cfg	model	model inference
DIM-VGG16	32.31	0.0233	28.89	31.45	28.4	175.5	30.4	cfg	model	model inference

Note：

The dataset for metrics is composed of PPM-100 and human part of AIM-500, with a total of 195 images, which named PPM-AIM-195.
The model default input size is (512, 512) while calculating FLOPs and FPS and the GPU is Tesla V100 32G. FPS is calculated base on Paddle Inference.
DIM is a trimap-base matting method, which metrics are calculated in transition area. If no trimap image is provided, the area 0<alpha<255 is used as the transition area after dilation erosion with a radius of 25 pixels.

Tutorials

Online experience
Quick start
Full development
Human matting android deployment
Human matting .NET deployment
Dataset preparation
AI Studio tutorials
- The Matting tutorial of PaddleSeg
- The image matting tutorial of PP-Matting

Acknowledgement

Thanks Qian bin for their contributons.
Thanks for the algorithm support of GFM.

Citation

@article{chen2022pp,
  title={PP-Matting: High-Accuracy Natural Image Matting},
  author={Chen, Guowei and Liu, Yi and Wang, Jian and Peng, Juncai and Hao, Yuying and Chu, Lutao and Tang, Shiyu and Wu, Zewu and Chen, Zeyu and Yu, Zhiliang and others},
  journal={arXiv preprint arXiv:2204.09433},
  year={2022}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Matting

Matting

configs

configs

demo

demo

deploy

deploy

docs

docs

ppmatting

ppmatting

tests

tests

tools

tools

README.md

README.md

README_CN.md

README_CN.md

requirements.txt

requirements.txt

README.md

Image Matting

Contents

Introduction

Update Notes

Community

Models

Tutorials

Acknowledgement

Citation

Files

Matting

Directory actions

More options

Directory actions

More options

Latest commit

History

Matting

Folders and files

parent directory

Image Matting

Contents

Introduction

Update Notes

Community

Models

Tutorials

Acknowledgement

Citation