Elsevier

Displays

Volume 70, December 2021, 102098
Displays

Environmental Perception with Spatial Regularization Correlation Filter for Visual Tracking

https://doi.org/10.1016/j.displa.2021.102098Get rights and content

Highlights

  • We propose a new framework for visual target tracking that perceives changes in the environment and adaptively adjusts the learning rate to avoid template pollution. Specifically, the APEC indicator and the response error between two frames are used to jointly perceive changes in the environment and adaptively adjust the learning rate.

  • We uses realistic background information to train filters, which not only suppresses the interference of background information, but also enhances the discrimination ability of the filter.

  • We solve the energy function of the filter as a non-closed solution. To reduce computational complexity, we use ADMM algorithm to transform the energy function into two subproblems with globally optimal solutions, because both subproblems are convex. Therefore, the two sub-problems have their own closed solution and globally optimal solutions.

  • Experimental results on three benchmarks, i.e., OTB-2015, TC-128 and VOT-2018 validate that the proposed approach obviously improves the tracking accuracy of CF based trackers.

Abstract

With the introduction of correlation filtering (CF), the performance of visual object tracking is significantly improved. Circular shifts collecting samples is a key component of the CF tracker, and it also causes negative boundary effects. Most trackers add spatial regularization to alleviate boundary effects well. However, these trackers ignore the effect of environmental changes on tracking performance, and the filter discriminates poorly in the background interference. Here, to break these limitations, we propose a new correlation filter model, namely Environmental Perception with Spatial Regularization Correlation Filter for Visual Tracking. Specifically, we use the Average Peak to Correlation Energy (APCE) and the response value error between the two frames together to perceive environmental changes, which adjusts the learning rate to make the template more adaptable to environmental changes. To enhance the discriminatory capability of the filter, we use real background information as negative samples to train the filter model. In addition, the introduction of the regular term destroys the closed solution of CF, and this problem can be effectively solved by the use of the alternating direction method of multipliers (ADMM). Extensive experimental evaluations on three large tracking benchmarks are performed, which demonstrate the good performance of the proposed method over some of the state-of-the-art trackers.

Introduction

Visual object tracking is a popular research direction in the field of computer vision. Its purpose is to be able to accurately determine the location of a target in an image sequence without the need to identify what the target is. It has been widely used in areas such as intelligent visual surveillance [1], [2], unmanned vehicles [3], [4] , human–computer interaction [5] and biology [6], etc. Although in the past decades, visual object tracking algorithms have developed rapidly and in a wide variety of ways, there are still many challenges. For example, the tracking process is often disturbed by scale variation deformation, fast movements or target occlusion. At the same time, achieving high-speed and high-precision target localization has always been a difficult task.

In CF trackers, the use of circular shifts and Fast Fourier Transform (FFT) for effective training and detection can greatly improve computational efficiency, but it also causes boundary effects. This problem leads to degraded tracking performance, especially in the case of fast moving targets. Recently, there are two schemes which can effectively reduce the influence of boundary effects. The first scheme is the use of cropping operations to collect real negative samples in order to improve samples quality, such as CFLB [7], BACF [8], etc. The second scheme adds spatial regularization to constrain the weight of the filter coefficients, such as SRDCF [9], CSR-DCF [10], etc. However, SRDCF uses circular shifts to obtain negative samples instead of using real negative samples, so it is prone to drift when the target is similar to the background information. SRDCF also failed to consider the effect of environmental changes on the template, which can lead to pollution of the template when the environment changes (e.g., occlusion, background clutters). In addition, SRDCF uses Gauss–Seidel solver filters, which are computationally expensive.

In this paper, for these problems mentioned above, we propose a novel visual object tracking algorithm, namely Environmental Perception with Spatial Regularization Correlation Filter for Visual Tracking. The main contributions of this paper are summarized as follows:

We propose a new framework for visual target tracking that perceives changes in the environment and adaptively adjusts the learning rate to avoid template pollution. Specifically, the APEC indicator [11] and the response error between two frames are used to jointly perceive changes in the environment and adaptively adjust the learning rate.

We uses realistic background information to train filters, which not only suppresses the interference of background information, but also enhances the discrimination ability of the filter.

We solve the energy function of the filter as a non-closed solution. To reduce computational complexity, we use ADMM algorithm [12] to transform the energy function into two sub-problems with globally optimal solutions, because both sub-problems are convex. Therefore, the two sub-problems have their own closed solution and globally optimal solutions.

Experimental results on three benchmarks, i.e., OTB-2015 [13], TC-128 [14] and VOT-2018 [15] validate that the proposed approach obviously improves the tracking accuracy of CF based trackers.

Section snippets

Correlation filter for tracking

Correlation filter (CF) was first applied in signal processing. It was not until 2010 that Bolme et al. [16] proposed the Minimum Output Sum of Squared Error (MOSSE) filter, which applies correlation filtering to visual tracking techniques and uses grayscale feature to establish the target appearance model. It uses the fast Fourier transform to cleverly transform the relevant calculations from the spatial domain to the frequency domain, which greatly reducing the computational effort and

Review BACF

CF trackers use cyclic shifts to collect training samples. This approach relies on the assumption of sample period expansion, which allows model training and target localization to be efficiently accomplished with the fast Fourier transform. However, it also brings negative boundary effects. The main reason is that negative samples generated by the circular shift are non-realistic, which is difficult to express the real image content, thus reducing the discriminative ability of filters. To

Implementation details

The experimental hardware environment is a computer with a CPU Intel Core i5-7500 and a single NVIDIA GTX 1030 GPU, a main frequency of 3.40 GHz, and an 8 GB memory configuration. The algorithm development platform is MATLAB R2018a. We used HOG features and depth features (Norm1 from VGG-M, Conv4-3 from VGG-16) to build the target appearance model. For scale estimation, we chose n=5. The initialized learning rate is η0=0.0185, α=3×103, and regularization parameter is λ=1.2. thresholds Φ1=0.5

Conclusion

In this work, we propose Environmental Perception with Spatial Regularization Correlation Filter for Visual Tracking. We use real background information as training samples and use the APEC indicator and the error value between two frames together to perceive changes in the environment and adaptively adjust the learning rate. In addition, our tracker is optimized using the ADMM algorithm to reduce the complexity of the computation. Extensive experimental results show that compared with many

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The authors thank all the reviewers and associate editors for their valuable comments. This work was supported by the National Natural Science Foundation of China [grant number U1813220], the Natural Science Foundation of Xinjiang Uygur Autonomous Region [grant number 2019D01C02] and the Fundamental Research Funds for the Central Universities [buctrc202105].

References (33)

  • ManafifardM. et al.

    A survey on player tracking in soccer videos

    Comput. Vis. Image Underst.

    (2017)
  • RanT. et al.

    Scene perception based visual navigation of mobile robot in indoor environment

    ISA Trans.

    (2021)
  • BougetD. et al.

    Vision-based and marker-less surgical tool detection and tracking: a review of the literature

    Med. Image Anal.

    (2017)
  • GaoZ. et al.

    Extended geometric models for stereoscopic 3D with vertical screen disparity

    Displays

    (2020)
  • GaoM. et al.

    Manifold siamese network: A novel visual tracking ConvNet for autonomous vehicles

    IEEE Trans. Intell. Transp. Syst.

    (2020)
  • YuanL. et al.

    Object tracking with particle filtering in fluorescence microscopy images: Application to the motion of neurofilaments in axons

    IEEE Trans. Med. Imaging

    (2012)
  • H.K. Galoogahi, T. Sim, S. Lucey, Correlation filters with limited boundaries, in: 2015 IEEE Conference on Computer...
  • H.K. Galoogahi, A. Fagg, S. Lucey, Learning background-aware correlation filters for visual tracking, in: 2017 IEEE...
  • M. Danelljan, G. Häger, F. Khan, M. Felsberg, Learning spatially regularized correlation filters for visual tracking,...
  • A. Lukežic, T. Vojír, L.C. Zajc, J. Matas, M. Kristan, Discriminative correlation filter with channel and spatial...
  • M. Wang, Y. Liu, Z. Huang, Large margin object tracking with circulant feature maps, in: 2017 IEEE Conference on...
  • BoydS.P. et al.

    Distributed optimization and statistical learning via the alternating direction method of multipliers

    Found. Trends Mach. Learn.

    (2011)
  • WuY. et al.

    Object tracking benchmark

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2015)
  • LiangP. et al.

    Encoding color information for visual tracking: Algorithms and benchmark

    IEEE Trans. Image Process.

    (2015)
  • M. Kristan, A. Leonardis, J. Matas, M. Felsberg, R. Pflugfelder, L. Čehovin Zajc, T. Vojir, G. Bhat, A. Lukezic, A....
  • D.S. Bolme, J.R. Beveridge, B.A. Draper, Y.M. Lui, Visual object tracking using adaptive correlation filters, in: 2010...
  • Cited by (0)

    This paper was recommended for publication by G. Guangtao Zhai.

    View full text