Abstract
We describe a deep learning based method for estimating 3D facial expression coefficients. Unlike previous work, our process does not relay on facial landmark detection methods as a proxy step. Recent methods have shown that a CNN can be trained to regress accurate and discriminative 3D morphable model (3DMM) representations, directly from image intensities. By foregoing landmark detection, these methods were able to estimate shapes for occluded faces appearing in unprecedented viewing conditions. We build on those methods by showing that facial expressions can also be estimated by a robust, deep, landmark-free approach. Our ExpNet CNN is applied directly to the intensities of a face image and regresses a 29D vector of 3D expression coefficients. We propose a unique method for collecting data to train our network, leveraging on the robustness of deep networks to training label noise. We further offer a novel means of evaluating the accuracy of estimated expression coefficients: by measuring how well they capture facial emotions on the CK+ and EmotiW-17 emotion recognition benchmarks. We show that our ExpNet produces expression coefficients which better discriminate between facial emotions than those obtained using state of the art, facial landmark detectors. Moreover, this advantage grows as image scales drop, demonstrating that our ExpNet is more robust to scale changes than landmark detectors. Finally, our ExpNet is orders of magnitude faster than its alternatives.
Original language | English |
---|---|
Title of host publication | Proceedings - 13th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2018 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 122-129 |
Number of pages | 8 |
ISBN (Electronic) | 9781538623350 |
DOIs | |
State | Published - 5 Jun 2018 |
Event | 13th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2018 - Xi'an, China Duration: 15 May 2018 → 19 May 2018 |
Publication series
Name | Proceedings - 13th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2018 |
---|
Conference
Conference | 13th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2018 |
---|---|
Country/Territory | China |
City | Xi'an |
Period | 15/05/18 → 19/05/18 |
Bibliographical note
Publisher Copyright:© 2018 IEEE.
Keywords
- 3D expression modeling
- Deep neural networks