Temporal Weighting Appearance-Aligned Network for Nighttime Video Retrieval,IEEE Signal Processing Letters

当前位置： X-MOL 学术 › IEEE Signal Process. Lett. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Temporal Weighting Appearance-Aligned Network for Nighttime Video Retrieval
IEEE Signal Processing Letters ( IF 3.9 ) Pub Date : 2022-09-19 , DOI: 10.1109/lsp.2022.3207620
Weijian Ruan ₁ , Yiran Tao ₂ , Linjun Ruan ₃ , Xiujun Shu ₄ , Yu Qiao ₁

Affiliation

Video-based person re-identification (ReID) aims at re-identifying video sequences of a specified person from videos captured by disjoint cameras. Existing datasets and works on this task all focus on daytime scenarios and cannot adapt well to the nighttime scenarios, which is also of significant importance for practical applications. In this letter, we contribute a new dataset for nighttime video-based ReID, termed NIVIR, which contains 800 identities with over 228,000 images. NIVIR contains video shots under various lighting conditions, different weathers, and complex scenarios, which is consistent with the real nighttime outdoor surveillance. Furthermore, we propose a temporal weighting appearance-aligned network (TWAN) for nighttime video-based ReID, which is composed of a correlation-based appearance-aligned module (CAM) and a temporal weighting module (TWM). Specifically, CAM is proposed to reconstruct the adjacent feature maps to guarantee the appearance alignment between the central frame and its adjacent frames. TWM is designed to evaluate the frame quality of a tracklet and generate temporal weights to enhance the video representation. Extensive experiments conducted on our new NIVIR dataset demonstrate that the proposed TWAN outperforms the state-of-the-art methods. We believe that our NIVIR dataset and the comprehensive attempts for solving the nighttime ReID problem will push forward the development of the ReID research community.

中文翻译：

用于夜间视频检索的时间加权外观对齐网络

基于视频的人员重新识别（ReID）旨在从不相交的摄像机捕获的视频中重新识别指定人员的视频序列。现有的数据集和该任务的工作都集中在白天场景，不能很好地适应夜间场景，这对于实际应用也很重要。在这封信中，我们为基于夜间视频的 ReID 提供了一个新的数据集，称为 NIVIR，其中包含 800 个身份和超过 228,000 张图像。NIVIR包含各种光照条件、不同天气、复杂场景下的视频镜头，与真实的夜间户外监控相一致。此外，我们提出了一种用于夜间基于视频的 ReID 的时间加权外观对齐网络（TWAN），它由基于相关性的外观对齐模块（CAM）和时间加权模块（TWM）组成。具体来说，提出了 CAM 来重建相邻特征图，以保证中心帧与其相邻帧之间的外观对齐。TWM 旨在评估 tracklet 的帧质量并生成时间权重以增强视频表示。在我们新的 NIVIR 数据集上进行的大量实验表明，所提出的 TWAN 优于最先进的方法。我们相信我们的 NIVIR 数据集和解决夜间 ReID 问题的综合尝试将推动 ReID 研究社区的发展。CAM 被提出来重建相邻的特征图，以保证中心帧与其相邻帧之间的外观对齐。TWM 旨在评估 tracklet 的帧质量并生成时间权重以增强视频表示。在我们新的 NIVIR 数据集上进行的大量实验表明，所提出的 TWAN 优于最先进的方法。我们相信我们的 NIVIR 数据集和解决夜间 ReID 问题的综合尝试将推动 ReID 研究社区的发展。CAM 被提出来重建相邻的特征图，以保证中心帧与其相邻帧之间的外观对齐。TWM 旨在评估 tracklet 的帧质量并生成时间权重以增强视频表示。在我们新的 NIVIR 数据集上进行的大量实验表明，所提出的 TWAN 优于最先进的方法。我们相信我们的 NIVIR 数据集和解决夜间 ReID 问题的综合尝试将推动 ReID 研究社区的发展。

更新日期：2022-09-19

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>