当前位置: X-MOL 学术IET Comput. Vis. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Harnessing feedback region proposals for multi-object tracking
IET Computer Vision ( IF 1.5 ) Pub Date : 2020-11-16 , DOI: 10.1049/iet-cvi.2019.0943
Aswathy Prasanna Kumar 1 , Deepak Mishra 1
Affiliation  

In the tracking-by-detection approach of online multiple object tracking (MOT), a major challenge is how to associate object detections on the new video frame with previously tracked objects. Two important aspects that directly influence the performance of MOT are quality of detection and accuracy in data association. The authors propose an efficient and unified MOT framework for improved object detection, followed by enhanced object tracking. The object detection and tracking are considered as two independent functions in the tracking-by-detection paradigm. In this study, object detection accuracy has been increased by employing a faster region-based convolutional neural network (Faster R-CNN) modified with the feedback region proposals from the tracker. Target association is performed by the correlation filter-based Siamese CNN model, which finds the similarity score between the input image patches. The Siamese CNN is trained using a supervised hard sample mining strategy. An optical flow-based motion model is employed to predict the next probable location of the targets from the tracker and these region proposals are fed back to the classifier module of Faster R-CNN. The authors’ extensive analysis of publicly available MOT benchmark datasets and comparison with the state-of-the-art tracking methods demonstrate competitive tracking performance of the proposed MOT framework.

中文翻译:

利用反馈区域建议进行多对象跟踪

在在线多对象跟踪(MOT)的按检测跟踪方法中,主要的挑战是如何将新视频帧上的对象检测与以前跟踪的对象相关联。直接影响MOT性能的两个重要方面是检测质量和数据关联的准确性。作者提出了一种有效且统一的MOT框架,用于改进的对象检测,然后是增强的对象跟踪。在“逐个检测跟踪”范例中,对象检测和跟踪被视为两个独立的功能。在这项研究中,通过使用基于跟踪器的反馈区域建议修改的更快的基于区域的卷积神经网络(Faster R-CNN),提高了对象检测的准确性。目标关联是通过基于相关过滤器的Siamese CNN模型执行的,查找输入图像块之间的相似性得分。使用有监督的硬样本挖掘策略对暹罗CNN进行了培训。使用基于光流的运动模型来预测来自跟踪器的目标的下一个可能位置,并将这些区域建议反馈到Faster R-CNN的分类器模块。作者对公开可用的MOT基准数据集进行了广泛的分析,并与最新的跟踪方法进行了比较,证明了所提出的MOT框架具有竞争性的跟踪性能。使用基于光流的运动模型来预测来自跟踪器的目标的下一个可能位置,并将这些区域建议反馈到Faster R-CNN的分类器模块。作者对公开可用的MOT基准数据集进行了广泛的分析,并与最新的跟踪方法进行了比较,证明了所提出的MOT框架具有竞争性的跟踪性能。使用基于光流的运动模型来预测来自跟踪器的目标的下一个可能位置,并将这些区域建议反馈到Faster R-CNN的分类器模块。作者对公开可用的MOT基准数据集进行了广泛的分析,并与最新的跟踪方法进行了比较,证明了所提出的MOT框架具有竞争性的跟踪性能。
更新日期:2020-11-17
down
wechat
bug