2D+3D Facial Expression Recognition via Discriminative Dynamic Range Enhancement and Multi-Scale Learning [article]

Yang Jiao, Yi Niu, Trac D. Tran, Guangming Shi
2020 arXiv   pre-print
In 2D+3D facial expression recognition (FER), existing methods generate multi-view geometry maps to enhance the depth feature representation. However, this may introduce false estimations due to local plane fitting from incomplete point clouds. In this paper, we propose a novel Map Generation technique from the viewpoint of information theory, to boost the slight 3D expression differences from strong personality variations. First, we examine the HDR depth data to extract the discriminative
more » ... ic range r_dis, and maximize the entropy of r_dis to a global optimum. Then, to prevent the large deformation caused by over-enhancement, we introduce a depth distortion constraint and reduce the complexity from O(KN^2) to O(KNτ). Furthermore, the constrained optimization is modeled as a K-edges maximum weight path problem in a directed acyclic graph, and we solve it efficiently via dynamic programming. Finally, we also design an efficient Facial Attention structure to automatically locate subtle discriminative facial parts for multi-scale learning, and train it with a proposed loss function ℒ_FA without any facial landmarks. Experimental results on different datasets show that the proposed method is effective and outperforms the state-of-the-art 2D+3D FER methods in both FER accuracy and the output entropy of the generated maps.
arXiv:2011.08333v1 fatcat:wapprzdrobhlho3j3ygsckqmsq