Visual tracking in video sequences based on biologically inspired mechanisms,Computer Vision and Image Understanding

当前位置： X-MOL 学术 › Comput. Vis. Image Underst. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Visual tracking in video sequences based on biologically inspired mechanisms
Computer Vision and Image Understanding ( IF 4.3 ) Pub Date : 2018-10-26 , DOI: 10.1016/j.cviu.2018.10.002
Alireza Sokhandan , Amirhassan Monadjemi

Visual tracking is the process of locating one or more objects based on their appearance. The high variation in the conditions and states of a moving object and presence of challenges such as background clutter, illumination variation, occlusion, etc. makes this problem extremely complex, and hard to achieve a robust algorithm in this field. However, unlike the machine vision, in the biological vision, the task of visual tracking is ideally conducted even in the worst conditions. Consequently, in this paper, taking into account the superior performance of biological vision in visual tracking, a biologically inspired visual tracking algorithm is introduced. The proposed algorithm inspiring the task-driven recognition procedure of the primary layers of the ventral pathway, and visual cortex mechanisms including spatial–temporal processing, motion perception, attention, and saliency to track a single object in the video sequence. For this purpose, a set of low-level features including the oriented-edges, color, and motion information (inspired by the layer V1) extracted from the target area and based on the discrimination rate that each feature creates with the background (inspired by the saliency mechanism), a subset of these features are employed to generate the appearance model and identify the target location. Moreover, by memorizing the shape and motion information (inspired by the short-term memory) scale variation and occlusion are handled. The experimental results showed that the proposed algorithm can well handle most of the visual tracking challenges, achieve high precision in target locating and act in a real-time manner.

中文翻译：

基于生物启发机制的视频序列中的视觉跟踪

视觉跟踪是根据一个或多个对象的外观来定位它们的过程。运动物体的状态和状态的高变化以及诸如背景杂波，照明变化，遮挡等挑战的存在，使该问题变得极其复杂，并且在该领域中难以实现鲁棒的算法。但是，与机器视觉不同，在生物视觉中，即使在最恶劣的条件下，视觉跟踪也可以理想地完成。因此，在本文中，考虑到生物视觉在视觉跟踪中的优越性能，提出了一种生物启发的视觉跟踪算法。提出的算法启发了腹侧通路主要层的任务驱动识别程序，以及视觉皮层机制，包括时空处理，运动感知，注意力和显着性来跟踪视频序列中的单个对象。为此，从目标区域提取的低层特征集包括定向边缘，颜色和运动信息（由层V1启发），并基于每个特征与背景创建的鉴别率（由层启发）。显着性机制），则使用这些功能的子集来生成外观模型并识别目标位置。此外，通过记忆形状和运动信息（受短期记忆启发），可以处理比例变化和遮挡。实验结果表明，该算法能够很好地应对大多数视觉跟踪挑战，实现了目标定位的高精度和实时性。并突出显示视频序列中的单个对象。为此，从目标区域提取的低层特征集包括定向边缘，颜色和运动信息（由层V1启发），并基于每个特征与背景创建的鉴别率（由层启发）。显着性机制），则使用这些特征的子集来生成外观模型并识别目标位置。此外，通过记忆形状和运动信息（受短期记忆启发），可以处理比例变化和遮挡。实验结果表明，该算法能够很好地应对大多数视觉跟踪挑战，实现目标定位的高精度和实时性。并突出显示视频序列中的单个对象。为此，从目标区域提取的低层特征集包括定向边缘，颜色和运动信息（由层V1启发），并基于每个特征与背景创建的鉴别率（由层启发）。显着性机制），则使用这些特征的子集来生成外观模型并识别目标位置。此外，通过记忆形状和运动信息（受短期记忆启发），可以处理比例变化和遮挡。实验结果表明，该算法能够很好地应对大多数视觉跟踪挑战，实现目标定位的高精度和实时性。一组低级特征，包括从目标区域提取的定向边缘，颜色和运动信息（由层V1启发），并基于每个特征与背景创建的辨别率（受显着性机制启发）这些特征的子集用于生成外观模型并标识目标位置。此外，通过记忆形状和运动信息（受短期记忆启发），可以处理比例变化和遮挡。实验结果表明，该算法能够很好地应对大多数视觉跟踪挑战，实现目标定位的高精度和实时性。一组低级特征，包括从目标区域提取的定向边缘，颜色和运动信息（由层V1启发），并基于每个特征与背景创建的辨别率（受显着性机制启发）这些特征的子集用于生成外观模型并标识目标位置。此外，通过记忆形状和运动信息（受短期记忆启发），可以处理比例变化和遮挡。实验结果表明，该算法能够很好地应对大多数视觉跟踪挑战，实现目标定位的高精度和实时性。以及从目标区域提取的运动信息（由第V1层启发），并基于每个特征与背景（由显着性机制启发）创建的辨别率，使用这些特征的子集生成外观模型并进行识别目标位置。此外，通过记忆形状和运动信息（受短期记忆启发），可以处理比例变化和遮挡。实验结果表明，该算法能够很好地应对大多数视觉跟踪挑战，实现了目标定位的高精度和实时性。以及从目标区域提取的运动信息（由第V1层启发），并基于每个特征与背景（由显着性机制启发）创建的辨别率，使用这些特征的子集生成外观模型并进行识别目标位置。此外，通过记忆形状和运动信息（受短期记忆启发），可以处理比例变化和遮挡。实验结果表明，该算法能够很好地应对大多数视觉跟踪挑战，实现了目标定位的高精度和实时性。通过记忆形状和运动信息（受短期记忆启发），可以处理尺度变化和遮挡。实验结果表明，该算法能够很好地应对大多数视觉跟踪挑战，实现目标定位的高精度和实时性。通过记忆形状和运动信息（受短期记忆启发），可以处理尺度变化和遮挡。实验结果表明，该算法能够很好地应对大多数视觉跟踪挑战，实现目标定位的高精度和实时性。

更新日期：2020-04-20

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11