Abstract
In the video surveillance of medical institutions, pain intensity is a significant clue to the state of patients. Of late, some approaches leverage various spatio-temporal methods to capture the dynamic pain information of videos for accomplishing pain estimation automatically. However, there is still a challenge in the spatio-temporal saliency, which means pain is always reflected in some important regions of informative image frames in a video sequence. To this end, we propose a deep spatio-temporal attention model called as Pain-Attentive Network (PAN), which pays more attention on the saliency in the extraction of dynamic features. PAN consists of two subnetworks: spatial and temporal subnetwork. Especially, in spatial subnetwork, a proposed spatial attention module is embedded to make the spatial feature extraction more targeted. Also, a devised temporal attention module is inserted in temporal subnetwork, so that the temporal features focus on informative image frames. Extensive experiment results on the UNBC-McMaster Shoulder Pain database show that our proposed PAN achieves compelling performances. In addition, to evaluate the generalization, we report competitive results of our proposed method in the Remote Collaborative and Affective database.
Similar content being viewed by others
References
Albanie S, Vedaldi A (2016) Learning grimaces by watching tv. arXiv preprint arXiv:1610.02255
Ashraf AB, Lucey S, Cohn JF, Chen T, Ambadar Z, Prkachin KM, Solomon PE (2009) The painful face–pain expression recognition using active appearance models. Image Vision Comput 27(12):1788–1796
Bahdanau D, Cho K, Bengio Y (2014) Neural machine translation by jointly learning to align and translate. arXiv:1409.0473
Barros P, Parisi GI, Weber C, Wermter S (2017) Emotion-modulated attention improves expression recognition: a deep learning model. Neurocomputing 253:104–114
Bartlett MS, Littlewort GC, Frank MG, Lee K (2014) Automatic decoding of facial movements reveals deceptive pain expressions. Curr Biol 24(7):738–743
Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv:1412.3555
Cootes TF, Edwards GJ, Taylor CJ (1998) Active appearance models. In: European conference on computer vision. Springer, New York, pp 484–498
Dong Y, Zhang Z, Hong WC (2018) A hybrid seasonal mechanism with a chaotic cuckoo search algorithm with a support vector regression model for electric load forecasting. Energies 11(4):1009
Florea C, Florea L, Vertan C (2014) Learning pain from emotion: transferred hot data representation for pain intensity estimation. In: European conference on computer vision. Springer, New York, pp 778–790
Hammal Z, Cohn JF (2012) Automatic detection of pain intensity. In: Proceedings of the 14th ACM international conference on multimodal interaction, ACM, pp 47–52
Hammal Z, Kunz M (2012) Pain monitoring: a dynamic and context-sensitive system. Pattern Recogn 45(4):1265–1280
Han J, Zhang Z, Cummins N, Ringeval F, Schuller B (2017) Strength modelling for real-worldautomatic continuous affect recognition from audiovisual signals. Image Vis Comput 65:76–86
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Hong WC, Dong Y, Lai CY, Chen LY, Wei SY (2011) Svr with hybrid chaotic immune algorithm for seasonal load demand forecasting. Energies 4(6):960–977
Hong WC, Li MW, Geng J, Zhang Y (2019) Novel chaotic bat algorithm for forecasting complex motion of floating platforms. Appl Math Model 72:425–443
Hong X, Zhao G, Zafeiriou S, Pantic M, Pietikäinen M (2016) Capturing correlations of local features for image representation. Neurocomputing 184:99–106
Huang D, Xia Z, Li L, Wang K, Feng X (2019) Pain-awareness multistream convolutional neural network for pain estimation. J Electron Imaging 28(4):043,008
Irani R, Nasrollahi K, Moeslund TB (2015) Pain recognition using spatiotemporal oriented energy of facial muscles. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 80–87
Jaderberg M, Simonyan K, Zisserman A, et al. (2015) Spatial transformer networks. In: Advances in neural information processing systems, pp 2017–2025
Kaltwang S, Rudovic O, Pantic M (2012) Continuous pain intensity estimation from facial expressions. In: International symposium on visual computing. Springer, New York, pp 368–377
Kaya H, Gürpınar F, Salah AA (2017) Video-based emotion recognition in the wild using deep transfer learning and score fusion. Image Vision Comput 65:66–75
Kollias D, Tzirakis P, Nicolaou MA, Papaioannou A, Zhao G, Schuller B, Kotsia I, Zafeiriou S (2019) Deep affect prediction in-the-wild: Aff-wild database and challenge, deep architectures, and beyond. Int J Comput Vis 127 (6-7):907–929
Kundra H, Sadawarti H (2015) Hybrid algorithm of cuckoo search and particle swarm optimization for natural terrain feature extraction. Res J Inf Technol 7(1):58–69
Li L, Xia Z, Hadid A, Jiang X, Zhang H, Feng X (2019) Replayed video attack detection based on motion blur analysis. IEEE Trans Inform Forensics Secur 14(9):2246–2261
Li Y, Zeng J, Shan S, Chen X (2019) Occlusion aware facial expression recognition using cnn with attention mechanism. IEEE Trans Image Process 28(5):2439–2450
Littlewort GC, Bartlett MS, Lee K (2009) Automatic coding of facial expressions displayed during posed and genuine pain. Image Vis Comput 27(12):1797–1803
Liu D, Peng F, Shea A, Picard R (2017) Deepfacelift: interpretable personalized models for automatic estimation of self-reported pain. J Mach Learn Res 66:1–16
Liu M, Li S, Shan S, Wang R, Chen X (2014) Deeply learning deformable facial action parts model for dynamic expression analysis. In: Asian conference on computer vision. Springer, New York, pp 143–157
Lucey P, Cohn JF, Prkachin KM, Solomon PE, Chew S, Matthews I (2012) Painful monitoring: automatic pain monitoring using the unbc-mcmaster shoulder pain expression archive database. Image Vis Comput 30(3):197–205
Lucey P, Cohn JF, Prkachin KM, Solomon PE, Matthews I (2011) Painful data: the unbc-mcmaster shoulder pain expression archive database. In: Face and gesture 2011, IEEE, pp 57–64
Martinez L, Rosalind Picard D, et al. (2017) Personalized automatic estimation of self-reported pain intensity from facial expressions. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 70–79
Minaee S, Abdolrashidi A (2019) Deep-emotion:, Facial expression recognition using attentional convolutional network. arXiv:1902.01019
Neshov N, Manolova A (2015) Pain detection from facial characteristics using supervised descent method. In: 2015 IEEE 8Th international conference on intelligent data acquisition and advanced computing systems: technology and applications (IDAACS), IEEE, vol 1, pp 251–256
Pei W, Dibeklioġlu H, Baltrušaitis T, Tax DM (2017) Attended end-to-end architecture for age estimation from facial expression videos. arXiv:1711.08690
Prkachin KM (1992) The consistency of facial expressions of pain: a comparison across modalities. Pain 51(3):297–306
Rathee N, Ganotra D (2017) A novel approach for continuous pain intensity estimation. In: Proceeding of international conference on intelligent communication, control and devices. Springer, New York, pp 443–450
Ringeval F, Sonderegger A, Sauer J, Lalanne D (2013) Introducing the recola multimodal corpus of remote collaborative and affective interactions. In: 2013 10Th IEEE international conference and workshops on automatic face and gesture recognition (FG), IEEE, pp 1–8
Rodriguez P, Cucurull G, Gonzàlez J, Gonfaus JM, Nasrollahi K, Moeslund TB, Roca FX (2017) Deep pain: exploiting long short-term memory networks for facial expression classification. IEEE Trans Cybern
Rudovic O, Pavlovic V, Pantic M (2015) Context-sensitive dynamic ordinal regression for intensity estimation of facial action units. IEEE Trans Patt Anal Mach Intell 37(5):944–958
Ruiz A, Rudovic O, Binefa X, Pantic M (2018) Multi-instance dynamic ordinal random fields for weakly supervised facial behavior analysis. IEEE Trans Image Process 27(8):3969–3982
Sikka K, Dhall A, Bartlett MS (2014) Classification and weakly supervised pain localization using multiple segment representation. Image Vision Comput 32(10):659–670
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556
Sun W, Zhao H, Jin Z (2018) A visual attention based roi detection method for facial expression recognition. Neurocomputing 296:12–22
Tavakolian M, Hadid A (2018) Deep binary representation of facial expressions: a novel framework for automatic pain intensity recognition. In: 2018 25Th IEEE international conference on image processing (ICIP), IEEE, pp 1952–1956
Tavakolian M, Hadid A (2018) Deep spatiotemporal representation of the face for automatic pain intensity estimation. In: 2018 24Th international conference on pattern recognition (ICPR), IEEE, pp 350–354
Tzirakis P, Trigeorgis G, Nicolaou MA, Schuller BW, Zafeiriou S (2017) End-to-end multimodal emotion recognition using deep neural networks. IEEE J Select Topics Signal Process 11(8):1301–1309
Wang F, Xiang X, Liu C, Tran TD, Reiter A, Hager GD, Quon H, Cheng J, Yuille AL (2017) Regularizing face verification nets for pain intensity regression. In: 2017 IEEE international conference on image processing (ICIP), IEEE, pp 1087–1091
Wang J, Sun H (2018) Pain intensity estimation using deep spatiotemporal and handcrafted features. IEICE Trans Inf Syst 101(6):1572–1580
Werner P, Al-Hamadi A, Limbrecht-Ecklundt K, Walter S, Gruss S, Traue HC (2016) Automatic pain assessment with facial activity descriptors. IEEE Trans Affect Comput 8(3):286–299
Werner P, Al-Hamadi A, Niese R (2012) Pain recognition and intensity rating based on comparative learning. In: 2012 19Th IEEE international conference on image processing, IEEE, pp 2313–2316
Werner P, Al-Hamadi A, Niese R, Walter S, Gruss S, Traue HC (2014) Automatic pain recognition from video and biomedical signals. In: 2014 22Nd international conference on pattern recognition, IEEE, pp 4582–4587
Xia Z, Hong X, Gao X, Feng X, Zhao G (2020) Spatiotemporal recurrent convolutional networks for recognizing spontaneous micro-expressions. IEEE Trans Multimed 22(3):626–640
Xu K, Ba J, Kiros R, Cho K, Courville A, Salakhudinov R, Zemel R, Bengio Y (2015) Show, attend and tell: neural image caption generation with visual attention. In: International conference on machine learning, pp 2048–2057
Yang R, Hong X, Peng J, Feng X, Zhao G (2018) Incorporating high-level and low-level cues for pain intensity estimation. In: 2018 24Th international conference on pattern recognition (ICPR), IEEE, pp 3495–3500
Yang Z, He X, Gao J, Deng L, Smola A (2016) Stacked attention networks for image question answering. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 21–29
Zhang Y, Zhao R, Dong W, Hu BG, Ji Q (2018) Bilateral ordinal relevance multi-instance regression for facial action unit intensity estimation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7034–7043
Zhang Z, Hong W, Li J (2020) Electric load forecasting by hybrid self-recurrent support vector regression model with variational mode decomposition and improved cuckoo search algorithm. IEEE Access 8:14,642–14,658
Zhang Z, Hong WC (2019) Electric load forecasting by complete ensemble empirical mode decomposition adaptive noise and support vector regression with quantum-based dragonfly algorithm. Nonlinear Dyn 98(2):1107–1136
Zhao R, Gan Q, Wang S, Ji Q (2016) Facial expression intensity estimation using ordinal information. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3466–3474
Zhou J, Hong X, Su F, Zhao G (2016) Recurrent convolutional neural network regression for continuous pain intensity estimation in video. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 84–92
Zwakhalen SM, Hamers JP, Abu-Saad HH, Berger MP (2006) Pain in elderly people with severe dementia: a systematic review of behavioural pain assessment tools. BMC Geriatrics 6(1):3
Acknowledgements
This work is partly supported by the National Natural Science Foundation of China (No. 61702419), and the Natural Science Basic Research Plan in Shaanxi Province of China (No. 2018JQ6090).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interests
There is no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Huang, D., Xia, Z., Mwesigye, J. et al. Pain-attentive network: a deep spatio-temporal attention model for pain estimation. Multimed Tools Appl 79, 28329–28354 (2020). https://doi.org/10.1007/s11042-020-09397-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-09397-1