当前位置: X-MOL 学术J. Circuits Syst. Comput. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A Description Scheme for Video Overview Based on Scene Detection and Face Clustering
Journal of Circuits, Systems and Computers ( IF 1.5 ) Pub Date : 2020-08-09 , DOI: 10.1142/s021812662150002x
Boyuan Tang 1 , Weiting Chen 1
Affiliation  

With the rapid growth of online videos, it is crucial to generate overviews of videos to help audiences make viewing decisions and save time. Video summarization and video captioning are two of the most common solutions. In this paper, we proposed a new solution in the form of a series of scene-person pairs generated from our proposed video description scheme. This new formation takes substantially less time than watching video summaries and is more acceptable than video captions. In addition, our method can be generalized to different types of videos. We also proposed a face clustering method and a scene detection method. The experimental results indicate that our methods outperform other state-of-the-art methods and are highly generalizable. As an example, a demo application is developed to demonstrate the proposed description scheme.

中文翻译:

一种基于场景检测和人脸聚类的视频概览描述方案

随着在线视频的快速增长,生成视频概览以帮助观众做出观看决定并节省时间至关重要。视频摘要和视频字幕是两种最常见的解决方案。在本文中,我们提出了一种新的解决方案,其形式为从我们提出的视频描述方案中生成的一系列场景人对。这种新形式比观看视频摘要花费的时间要少得多,并且比视频字幕更容易接受。此外,我们的方法可以推广到不同类型的视频。我们还提出了一种人脸聚类方法和一种场景检测方法。实验结果表明,我们的方法优于其他最先进的方法并且具有高度的泛化性。例如,开发了一个演示应用程序来演示所提出的描述方案。
更新日期:2020-08-09
down
wechat
bug