当前位置: X-MOL 学术ACM Trans. Intell. Syst. Technol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Video Object Segmentation and Tracking
ACM Transactions on Intelligent Systems and Technology ( IF 5 ) Pub Date : 2020-05-26 , DOI: 10.1145/3391743
Rui Yao 1 , Guosheng Lin 2 , Shixiong Xia 3 , Jiaqi Zhao 3 , Yong Zhou 3
Affiliation  

Object segmentation and object tracking are fundamental research areas in the computer vision community. These two topics are difficult to handle some common challenges, such as occlusion, deformation, motion blur, scale variation, and more. The former contains heterogeneous object, interacting object, edge ambiguity, and shape complexity; the latter suffers from difficulties in handling fast motion, out-of-view, and real-time processing. Combining the two problems of Video Object Segmentation and Tracking (VOST) can overcome their respective difficulties and improve their performance. VOST can be widely applied to many practical applications such as video summarization, high definition video compression, human computer interaction, and autonomous vehicles. This survey aims to provide a comprehensive review of the state-of-the-art VOST methods, classify these methods into different categories, and identify new trends. First, we broadly categorize VOST methods into Video Object Segmentation (VOS) and Segmentation-based Object Tracking (SOT). Each category is further classified into various types based on the segmentation and tracking mechanism. Moreover, we present some representative VOS and SOT methods of each time node. Second, we provide a detailed discussion and overview of the technical characteristics of the different methods. Third, we summarize the characteristics of the related video dataset and provide a variety of evaluation metrics. Finally, we point out a set of interesting future works and draw our own conclusions.

中文翻译:

视频对象分割和跟踪

对象分割和对象跟踪是计算机视觉社区的基础研究领域。这两个主题很难处理一些常见的挑战,例如遮挡、变形、运动模糊、尺度变化等等。前者包含异构对象、交互对象、边缘模糊性和形状复杂性;后者在处理快速运动、视野外和实时处理方面存在困难。结合视频对象分割和跟踪(VOST)这两个问题可以克服各自的困难,提高它们的性能。VOST可广泛应用于视频摘要、高清视频压缩、人机交互、自动驾驶汽车等诸多实际应用。本调查旨在全面回顾最先进的 VOST 方法,将这些方法分为不同的类别,并确定新的趋势。首先,我们将 VOST 方法大致分为视频对象分割 (VOS) 和基于分割的对象跟踪 (SOT)。每个类别根据分割和跟踪机制进一步分为各种类型。此外,我们还介绍了每个时间节点的一些具有代表性的 VOS 和 SOT 方法。其次,我们对不同方法的技术特征进行了详细讨论和概述。第三,我们总结了相关视频数据集的特点,并提供了多种评价指标。最后,我们指出了一组有趣的未来工作,并得出了我们自己的结论。我们将 VOST 方法大致分为视频对象分割 (VOS) 和基于分割的对象跟踪 (SOT)。每个类别根据分割和跟踪机制进一步分为各种类型。此外,我们还介绍了每个时间节点的一些具有代表性的 VOS 和 SOT 方法。其次,我们对不同方法的技术特征进行了详细讨论和概述。第三,我们总结了相关视频数据集的特点,并提供了多种评价指标。最后,我们指出了一组有趣的未来工作,并得出了我们自己的结论。我们将 VOST 方法大致分为视频对象分割 (VOS) 和基于分割的对象跟踪 (SOT)。每个类别根据分割和跟踪机制进一步分为各种类型。此外,我们还介绍了每个时间节点的一些具有代表性的 VOS 和 SOT 方法。其次,我们对不同方法的技术特征进行了详细讨论和概述。第三,我们总结了相关视频数据集的特点,并提供了多种评价指标。最后,我们指出了一组有趣的未来工作,并得出了我们自己的结论。我们对不同方法的技术特征进行了详细讨论和概述。第三,我们总结了相关视频数据集的特点,并提供了多种评价指标。最后,我们指出了一组有趣的未来工作,并得出了我们自己的结论。我们对不同方法的技术特征进行了详细讨论和概述。第三,我们总结了相关视频数据集的特点,并提供了多种评价指标。最后,我们指出了一组有趣的未来工作,并得出了我们自己的结论。
更新日期:2020-05-26
down
wechat
bug