6 July 2022 SiamGauss: Siamese region proposal network with Gaussian head for visual object tracking
Author Affiliations +
Abstract

We propose SiamGauss, a Siamese region proposal network with a Gaussian head for single-target visual object tracking for aerial benchmarks. Visual tracking in aerial videos faces unique challenges due to the large field of view resulting in small size objects, similar looking objects (confusers) in close proximity, occlusions, and fast motion due to simultaneous object and camera motion. In Siamese tracking, a cross-correlation ration is performed in the embedding space to obtain a similarity map of the target within a search frame, which is then used to localize the target. The proposed Gaussian head helps suppress the activation produced in the similarity map on confusers present in the search frame during training while boosting the confidence on the target. This activation suppression improves the confuser awareness of our tracker. In addition, improving the activation on the target helps maintain tracking consistency in fast motion. Our proposed Gaussian head is only applied during training and introduces no additional computational overhead during inference while tracking. Thus, SiamGauss achieves fast runtime performance. We evaluate our method on multiple aerial benchmarks showing that SiamGauss performs favorably with state-of-the-art trackers while rating at a frame rate of 96 frames per second.

© 2022 Society of Photo-Optical Instrumentation Engineers (SPIE) 1931-3195/2022/$28.00 © 2022 SPIE
Abu Md Niamul Taufique, Breton Minnehan, and Andreas E. Savakis "SiamGauss: Siamese region proposal network with Gaussian head for visual object tracking," Journal of Applied Remote Sensing 16(3), 036501 (6 July 2022). https://doi.org/10.1117/1.JRS.16.036501
Received: 2 March 2022; Accepted: 21 June 2022; Published: 6 July 2022
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical tracking

Head

Visualization

Video

Fermium

Frequency modulation

Cameras

Back to Top