Face Recognition Based on Videos by Using Convex Hulls,IEEE Transactions on Circuits and Systems for Video Technology

当前位置： X-MOL 学术 › IEEE Trans. Circ. Syst. Video Technol. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Face Recognition Based on Videos by Using Convex Hulls
IEEE Transactions on Circuits and Systems for Video Technology ( IF 8.4 ) Pub Date : 2020-12-01 , DOI: 10.1109/tcsvt.2019.2926165
Hakan Cevikalp , Hasan Serhan Yavuz , Bill Triggs

A wide range of face appearance variations can be modeled by using set-based recognition approaches effectively, but computational complexity of current methods is highly dependent on the set and class sizes. This paper introduces new video-based classification methods designed for reducing the required disk space of data samples and speed up the testing process in large-scale face recognition systems. In the proposed method, image sets collected from videos are approximated with kernelized convex hulls and it was shown that it is sufficient to use only the samples that participate in shaping the image set boundaries in this setting. The kernelized support vector data description (SVDD) is used to extract those important samples that form the image set boundaries. Moreover, we show that these kernelized hypersphere models can also be used to approximate image sets for classification purposes. Then, we propose a binary hierarchical decision tree approach to improve the speed of the classification system even more. At last, we introduce a new video database that includes 285 people with 8 videos of each person, since the most popular video data sets used for set-based recognition methods include either a few people, or small number of videos per person. The experimental results on varying sized databases show that the proposed methods greatly improve the testing times of the classification system (we obtained speed-ups to a factor of 20) without a significant drop in accuracies.

中文翻译：

基于视频的凸包人脸识别

通过有效地使用基于集合的识别方法可以对广泛的面部外观变化进行建模，但当前方法的计算复杂性高度依赖于集合和类的大小。本文介绍了新的基于视频的分类方法，旨在减少数据样本所需的磁盘空间并加快大规模人脸识别系统中的测试过程。在所提出的方法中，从视频收集的图像集用核化凸包进行近似，结果表明，在此设置中仅使用参与塑造图像集边界的样本就足够了。核化支持向量数据描述 (SVDD) 用于提取构成图像集边界的那些重要样本。而且，我们展示了这些核化的超球面模型也可用于近似图像集以进行分类。然后，我们提出了一种二元分层决策树方法，以进一步提高分类系统的速度。最后，我们引入了一个新的视频数据库，该数据库包含 285 个人，每个人有 8 个视频，因为用于基于集合的识别方法的最流行的视频数据集包括几个人，或者每个人的少量视频。在不同大小的数据库上的实验结果表明，所提出的方法大大提高了分类系统的测试时间（我们获得了 20 倍的加速），而准确率没有显着下降。我们提出了一种二元分层决策树方法，以进一步提高分类系统的速度。最后，我们引入了一个新的视频数据库，该数据库包含 285 个人，每个人有 8 个视频，因为用于基于集合的识别方法的最流行的视频数据集包括几个人，或者每个人的少量视频。在不同大小的数据库上的实验结果表明，所提出的方法大大提高了分类系统的测试时间（我们获得了 20 倍的加速），而准确率没有显着下降。我们提出了一种二元分层决策树方法，以进一步提高分类系统的速度。最后，我们引入了一个新的视频数据库，该数据库包含 285 个人，每个人有 8 个视频，因为用于基于集合的识别方法的最流行的视频数据集包括几个人，或者每个人的少量视频。在不同大小的数据库上的实验结果表明，所提出的方法大大提高了分类系统的测试时间（我们获得了 20 倍的加速），而准确率没有显着下降。

更新日期：2020-12-01

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>