Correlation Discrepancy Insight Network for Video Re-identification,ACM Transactions on Multimedia Computing, Communications, and Applications

当前位置： X-MOL 学术 › ACM Trans. Multimed. Comput. Commun. Appl. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Correlation Discrepancy Insight Network for Video Re-identification
ACM Transactions on Multimedia Computing, Communications, and Applications ( IF 5.1 ) Pub Date : 2020-12-17 , DOI: 10.1145/3402666
Weijian Ruan ₁ , Chao Liang ₁ , Yi Yu ₂ , Zheng Wang ₂ , Wu Liu ₃ , Jun Chen ₁ , Jiayi Ma ₄

Affiliation

Video-based person re-identification (ReID) aims at re-identifying a specified person sequence from videos that were captured by disjoint cameras. Most existing works on this task ignore the quality discrepancy across frames by using all video frames to develop a ReID method. Additionally, they adopt only the person self-characteristic as the representation, which cannot adapt to cross-camera variation effectively. To that end, we propose a novel correlation discrepancy insight network for video-based person ReID, which consists of an unsupervised correlation insight model (CIM) for video purification and a discrepancy description network (DDN) for person representation. Concretely, CIM is constructed by using kernelized correlation filters to encode person half-parts, which evaluates the frame quality by the cross correlation across frames for selecting discriminative video fragments. Furthermore, DDN exploits the selected video fragments to generate a discrepancy descriptor using a compression network, which aims at employing the discrepancies with other persons’ to facilitate the representation of the target person rather than only using the self-characteristic. Due to the advantage in handling cross-domain variation, the discrepancy descriptor is expected to provide a new pattern for the object representation in cross-camera tasks. Experimental results on three public benchmarks demonstrate that the proposed method outperforms several state-of-the-art methods.

中文翻译：

用于视频重新识别的相关差异洞察网络

基于视频的人员重新识别 (ReID) 旨在从不相交的摄像机捕获的视频中重新识别指定的人员序列。大多数现有的关于此任务的工作通过使用所有视频帧来开发 ReID 方法，忽略了帧之间的质量差异。此外，它们仅采用人的自我特征作为表示，不能有效地适应跨相机的变化。为此，我们提出了一种新的基于视频的人 ReID 的相关差异洞察网络，它由一个用于视频净化的无监督相关洞察模型 (CIM) 和一个用于人表示的差异描述网络 (DDN) 组成。具体来说，CIM 是通过使用核相关滤波器对人的半部分进行编码来构建的，它通过跨帧的互相关来评估帧质量，以选择有区别的视频片段。此外，DDN 利用选定的视频片段使用压缩网络生成差异描述符，其目的是利用与其他人的差异来促进目标人的表示，而不仅仅是使用自我特征。由于处理跨域变化的优势，差异描述符有望为跨相机任务中的对象表示提供新模式。三个公共基准的实验结果表明，所提出的方法优于几种最先进的方法。DDN利用所选视频片段使用压缩网络生成差异描述符，其目的是利用与其他人的差异来促进目标人的表示，而不仅仅是使用自我特征。由于处理跨域变化的优势，差异描述符有望为跨相机任务中的对象表示提供新模式。三个公共基准的实验结果表明，所提出的方法优于几种最先进的方法。DDN利用所选视频片段使用压缩网络生成差异描述符，其目的是利用与其他人的差异来促进目标人的表示，而不仅仅是使用自我特征。由于处理跨域变化的优势，差异描述符有望为跨相机任务中的对象表示提供新模式。三个公共基准的实验结果表明，所提出的方法优于几种最先进的方法。差异描述符有望为跨相机任务中的对象表示提供一种新模式。三个公共基准的实验结果表明，所提出的方法优于几种最先进的方法。差异描述符有望为跨相机任务中的对象表示提供一种新模式。三个公共基准的实验结果表明，所提出的方法优于几种最先进的方法。

更新日期：2020-12-17

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>