SGS2Net: deep representation of facial expression by graph-preserving sparse coding

Ruicong Zhi; Ming Wan; Xin Hu

doi:10.1117/1.JEI.29.6.063015

26 December 2020 SGS2Net: deep representation of facial expression by graph-preserving sparse coding

Ruicong Zhi, Ming Wan, Xin Hu

Author Affiliations +

Journal of Electronic Imaging, Vol. 29, Issue 6, 063015 (December 2020). https://doi.org/10.1117/1.JEI.29.6.063015

Abstract

Recently, deep learning has developed rapidly and made great improvements in facial expression recognition. However, deep learning has black box properties that lead to poor interpretability of the results. Differentiable programming provides a new aspect to balance the interpretability and convenience of deep learning. It provides a way to solve sparse coding problem that has solid mathematical foundations through recurrent neural network end-to-end. However, sparse representation is a traditional unsupervised learning method, and it does not effectively exploit the supervised information, which is helpful for facial expression recognition. We propose a differentiable programming algorithm that is called supervised graph-preserving sparse two neural network (SGS2Net) by exploiting both sparse and graph-preserving properties. The graph-preserving constraint may contain the class information. Therefore, the new model can be conducted as an unsupervised or a supervised way. A sparse representation of the facial images is obtained by minimizing the l₁-norm of the coefficients, and the neighborhood of the samples is preserved by retaining the graph structure. The optimization procedure is conducted by gradient descent and threshold shrinkage and implemented by a new deep network structure end-to-end. SGS2Net is applied to facial micro-expression recognition and facial expression-based pain assessment, and it enhances the recognition accuracy by more than 10% and 4%, respectively, compared to state-of-the-art. It derives the fact that graph-preserving constraint improves the discriminant property of the network greatly, and SGS2Net has solid interpretability from sparse representation and graph embedding theory.

Citation Download Citation

Ruicong Zhi, Ming Wan, and Xin Hu "SGS2Net: deep representation of facial expression by graph-preserving sparse coding," Journal of Electronic Imaging 29(6), 063015 (26 December 2020). https://doi.org/10.1117/1.JEI.29.6.063015

Received: 15 June 2020; Accepted: 25 November 2020; Published: 26 December 2020

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $24.00

Non-members: $28.00 ADD TO CART

JOURNAL ARTICLE
13 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Facial recognition systems

Databases

Associative arrays

Neural networks

Computer programming

Machine learning

Solids

Show All Keywords

Keywords/Phrases

Search In:

Publication Years