FASHE: A FrActal Based Strategy for Head Pose Estimation,IEEE Transactions on Image Processing

当前位置： X-MOL 学术 › IEEE Trans. Image Process. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

FASHE: A FrActal Based Strategy for Head Pose Estimation
IEEE Transactions on Image Processing ( IF 10.6 ) Pub Date : 2021-02-22 , DOI: 10.1109/tip.2021.3059409
Carmen Bisogni , Michele Nappi , Chiara Pero , Stefano Ricciardi

Head pose estimation (HPE) represents a topic central to many relevant research fields and characterized by a wide application range. In particular, HPE performed using a singular RGB frame is particular suitable to be applied at best-frame-selection problems. This explains a growing interest witnessed by a large number of contributions, most of which exploit deep learning architectures and require extensive training sessions to achieve accuracy and robustness in estimating head rotations on three axes. However, methods alternative to machine learning approaches could be capable of similar if not better performance. To this regard, we present FASHE, an approach based on partitioned iterated function systems (PIFS) to represent auto-similarities within face image through a contractive affine function transforming the domain blocks extracted only once by a single frontal reference image, in a good approximation of the range blocks which the target image has been partitioned into. Pose estimation is achieved by finding the closest match between fractal code of target image and a reference array by means of Hamming distance. The results of experiments conducted exceed the state of the art on both Biwi and Ponting’04 datasets as well as approaching those of the best performing methods on the challenging AFLW2000 database. In addition, the applications to GOTCHA Video Dataset demonstrate that FASHE successfully operates in-the-wild.

中文翻译：

FASHE：一种基于分形的头部姿势估计策略

头部姿势估计（HPE）是许多相关研究领域的核心主题，具有广泛的应用范围。特别是，使用单一 RGB 帧执行的 HPE 特别适合应用于最佳帧选择问题。这解释了人们对大量贡献日益增长的兴趣，其中大部分贡献利用深度学习架构，需要大量的培训课程才能在估计三个轴上的头部旋转时实现准确性和鲁棒性。然而，替代机器学习方法的方法即使不是更好的性能，也可能具有相似的性能。对此，我们提出 FASHE，一种基于分区迭代函数系统 (PIFS) 的方法，通过收缩仿射函数转换由单个正面参考图像仅提取一次的域块，以很好地逼近目标图像的范围块，从而表示人脸图像中的自相似性已经划分成。姿态估计是通过汉明距离找到目标图像的分形码与参考阵列之间的最接近匹配来实现的。所进行的实验结果超过了 Biwi 和 Ponting'04 数据集的最新技术水平，并接近具有挑战性的 AFLW2000 数据库上表现最佳的方法。此外，GOTCHA 视频数据集的应用表明 FASHE 成功地在野外运行。在目标图像被分割成的范围块的良好近似中。姿态估计是通过汉明距离找到目标图像的分形码与参考阵列之间的最接近匹配来实现的。所进行的实验结果超过了 Biwi 和 Ponting'04 数据集的最新技术水平，并接近具有挑战性的 AFLW2000 数据库上表现最佳的方法。此外，GOTCHA 视频数据集的应用表明 FASHE 成功地在野外运行。在目标图像被分割成的范围块的良好近似中。姿态估计是通过汉明距离找到目标图像的分形码与参考阵列之间的最接近匹配来实现的。所进行的实验结果超过了 Biwi 和 Ponting'04 数据集的最新技术水平，并接近具有挑战性的 AFLW2000 数据库上表现最佳的方法。此外，GOTCHA 视频数据集的应用表明 FASHE 成功地在野外运行。所进行的实验结果超过了 Biwi 和 Ponting'04 数据集的最新技术水平，并接近具有挑战性的 AFLW2000 数据库上表现最佳的方法。此外，GOTCHA 视频数据集的应用表明 FASHE 成功地在野外运行。所进行的实验结果超过了 Biwi 和 Ponting'04 数据集的最新技术水平，并接近具有挑战性的 AFLW2000 数据库上表现最佳的方法。此外，GOTCHA 视频数据集的应用表明 FASHE 成功地在野外运行。

更新日期：2021-02-26

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>