TY - JOUR
T1 - Facial Landmark Detection with Tweaked Convolutional Neural Networks
AU - Wu, Yue
AU - Hassner, Tal
AU - Kim, Kanggeon
AU - Medioni, Gerard
AU - Natarajan, Prem
N1 - Publisher Copyright:
© 1979-2012 IEEE.
PY - 2018/12/1
Y1 - 2018/12/1
N2 - This paper concerns the problem of facial landmark detection. We provide a unique new analysis of the features produced at intermediate layers of a convolutional neural network (CNN) trained to regress facial landmark coordinates. This analysis shows that while being processed by the CNN, face images can be partitioned in an unsupervised manner into subsets containing faces in similar poses (i.e., 3D views) and facial properties (e.g., presence or absence of eye-wear). Based on this finding, we describe a novel CNN architecture, specialized to regress the facial landmark coordinates of faces in specific poses and appearances. To address the shortage of training data, particularly in extreme profile poses, we additionally present data augmentation techniques designed to provide sufficient training examples for each of these specialized sub-networks. The proposed Tweaked CNN (TCNN) architecture is shown to outperform existing landmark detection methods in an extensive battery of tests on the AFW, ALFW, and 300W benchmarks. Finally, to promote reproducibility of our results, we make code and trained models publicly available through our project webpage.
AB - This paper concerns the problem of facial landmark detection. We provide a unique new analysis of the features produced at intermediate layers of a convolutional neural network (CNN) trained to regress facial landmark coordinates. This analysis shows that while being processed by the CNN, face images can be partitioned in an unsupervised manner into subsets containing faces in similar poses (i.e., 3D views) and facial properties (e.g., presence or absence of eye-wear). Based on this finding, we describe a novel CNN architecture, specialized to regress the facial landmark coordinates of faces in specific poses and appearances. To address the shortage of training data, particularly in extreme profile poses, we additionally present data augmentation techniques designed to provide sufficient training examples for each of these specialized sub-networks. The proposed Tweaked CNN (TCNN) architecture is shown to outperform existing landmark detection methods in an extensive battery of tests on the AFW, ALFW, and 300W benchmarks. Finally, to promote reproducibility of our results, we make code and trained models publicly available through our project webpage.
KW - Face and gesture recognition
KW - Neural nets
UR - http://www.scopus.com/inward/record.url?scp=85039792429&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2017.2787130
DO - 10.1109/TPAMI.2017.2787130
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 29990138
AN - SCOPUS:85039792429
SN - 0162-8828
VL - 40
SP - 3067
EP - 3074
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 12
M1 - 8239860
ER -