End-to-end face parsing via interlinked convolutional neural networks.

Cogn Neurodyn

School of Technology, Beijing Forestry University, Beijing, 100083 China.

Published: February 2021

Face parsing is an important computer vision task that requires accurate pixel segmentation of facial parts (such as eyes, nose, mouth, etc.), providing a basis for further face analysis, modification, and other applications. Interlinked Convolutional Neural Networks (iCNN) was proved to be an effective two-stage model for face parsing. However, the original iCNN was trained separately in two stages, limiting its performance. To solve this problem, we introduce a simple, end-to-end face parsing framework: STN-aided iCNN(STN-iCNN), which extends the iCNN by adding a Spatial Transformer Network (STN) between the two isolated stages. The STN-iCNN uses the STN to provide a trainable connection to the original two-stage iCNN pipeline, making end-to-end joint training possible. Moreover, as a by-product, STN also provides more precise cropped parts than the original cropper. Due to these two advantages, our approach significantly improves the accuracy of the original model. Our model achieved competitive performance on the Helen Dataset, the standard face parsing dataset. It also achieved superior performance on CelebAMask-HQ dataset, proving its good generalization. Our code has been released at https://github.com/aod321/STN-iCNN.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7947053PMC
http://dx.doi.org/10.1007/s11571-020-09615-4DOI Listing

Publication Analysis

Top Keywords

face parsing
20
end-to-end face
8
interlinked convolutional
8
convolutional neural
8
neural networks
8
parsing
5
face
5
parsing interlinked
4
networks face
4
parsing computer
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!