Estimation of BMI from facial images using semantic segmentation based region-aware pooling,Computers in Biology and Medicine

当前位置： X-MOL 学术 › Comput. Biol. Med. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Estimation of BMI from facial images using semantic segmentation based region-aware pooling
Computers in Biology and Medicine ( IF 7.7 ) Pub Date : 2021-04-15 , DOI: 10.1016/j.compbiomed.2021.104392
Nadeem Yousaf ₁ , Sarfaraz Hussein ₂ , Waqas Sultani ₁

Affiliation

Body-Mass-Index (BMI) conveys important information about one's life such as health and socio-economic conditions. Large-scale automatic estimation of BMIs can help predict several societal behaviors such as health, job opportunities, friendships, and popularity. The recent works have either employed hand-crafted geometrical face features or face-level deep convolutional neural network features for face to BMI prediction. The hand-crafted geometrical face feature lack generalizability and face-level deep features don't have detailed local information. Although useful, these methods missed the detailed local information which is essential for exact BMI prediction. In this paper, we propose to use deep features that are pooled from different face regions (eye, nose, eyebrow, lips, etc.) and demonstrate that this explicit pooling from face regions can significantly boost the performance of BMI prediction. To address the problem of accurate and pixel-level face regions localization, we propose to use face semantic segmentation in our framework. Extensive experiments are performed using different Convolutional Neural Network (CNN) backbones including FaceNet and VGG-face on three publicly available datasets: VisualBMI, Bollywood and VIP attributes. Experimental results demonstrate that, as compared to the recent works, the proposed Reg-GAP gives a percentage improvement of 22.4% on VIP-attribute, 3.3% on VisualBMI, and 63.09% on the Bollywood dataset.

中文翻译：

使用基于语义分割的区域感知池从面部图像估计BMI

身体质量指数（BMI）传达有关生命的重要信息，例如健康和社会经济状况。BMI的大规模自动估计可以帮助预测一些社会行为，例如健康，工作机会，友谊和受欢迎程度。最近的工作要么采用手工制作的几何面部特征，要么采用面部级深度卷积神经网络特征来预测BMI。手工制作的几何脸部特征缺乏可概括性，而脸部级别的深层特征没有详细的本地信息。尽管有用，但这些方法错过了详细的本地信息，这对于准确的BMI预测至关重要。在本文中，我们建议使用从不同脸部区域（眼睛，鼻子，眉毛，嘴唇等）汇集的深层特征。），并证明从面部区域进行的这种显式合并可以显着提高BMI预测的性能。为了解决准确的像素级面部区域定位问题，我们建议在我们的框架中使用面部语义分割。使用不同的卷积神经网络（CNN）骨干（包括FaceNet和VGG-face）在三个可公开获取的数据集上进行了广泛的实验：VisualBMI，宝莱坞和VIP属性。实验结果表明，与最近的工作相比，提出的Reg-GAP在VIP属性上的百分比提高了22.4％，在VisualBMI上提高了3.3％，在宝莱坞数据集上提高了63.09％。我们建议在我们的框架中使用人脸语义分割。使用不同的卷积神经网络（CNN）骨干（包括FaceNet和VGG-face）在三个可公开获取的数据集上进行了广泛的实验：VisualBMI，宝莱坞和VIP属性。实验结果表明，与最近的工作相比，提出的Reg-GAP在VIP属性上的百分比提高了22.4％，在VisualBMI上提高了3.3％，在宝莱坞数据集上提高了63.09％。我们建议在我们的框架中使用人脸语义分割。使用不同的卷积神经网络（CNN）骨干（包括FaceNet和VGG-face）在三个可公开获取的数据集上进行了广泛的实验：VisualBMI，宝莱坞和VIP属性。实验结果表明，与最近的工作相比，提出的Reg-GAP在VIP属性上的百分比提高了22.4％，在VisualBMI上提高了3.3％，在宝莱坞数据集上提高了63.09％。

更新日期：2021-04-22

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>