Part-based Structured Representation Learning for Person Re-identification,ACM Transactions on Multimedia Computing, Communications, and Applications

当前位置： X-MOL 学术 › ACM Trans. Multimed. Comput. Commun. Appl. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Part-based Structured Representation Learning for Person Re-identification
ACM Transactions on Multimedia Computing, Communications, and Applications ( IF 5.1 ) Pub Date : 2020-12-17 , DOI: 10.1145/3412384
Yaoyu Li ₁ , Hantao Yao ₁ , Tianzhu Zhang ₂ , Changsheng Xu ₃

Affiliation

Person re-identification aims to match person of interest under non-overlapping camera views. Therefore, how to generate a robust and discriminative representation is crucial for person re-identification. Mining local clues from human body parts to describe pedestrians has been extensively studied in existing methods. However, existing methods locate human body parts coarsely and do not consider the relations among different local parts. To address the above problem, we propose a Part-based Structured Representation Learning (PSRL) for better exploiting local clues to improve the person representation. There are two important modules in our architecture: Local Semantic Feature Extraction and Structured Person Representation Learning. The Local Semantic Feature Extraction module is designed to extract local features from human body semantic regions. After obtaining the local features, the Structured Person Representation Learning is proposed to fuse the local features by considering the person structure. To model the underlying person structure, a graph convolutional network is employed to capture the relations of different semantic regions. The generated structured feature encodes underlying person structure information, and local semantic feature can solve the misalignment problem caused by pose variations in feature matching. By combining them together, we can improve the descriptive ability of the generated representation. Extensive evaluations on four standard benchmarks show that our proposed method achieves competitive performance against state-of-the-art methods.

中文翻译：

用于人员重新识别的基于部分的结构化表示学习

人员重新识别旨在在不重叠的摄像机视图下匹配感兴趣的人员。因此，如何生成鲁棒和有区别的表示对于人员重新识别至关重要。从人体部位挖掘局部线索来描述行人已在现有方法中得到广泛研究。然而，现有方法对人体部位的定位粗略，没有考虑不同局部部位之间的关系。为了解决上述问题，我们提出了一种基于部分的结构化表示学习（PSRL），以更好地利用局部线索来改善人物表示。我们的架构中有两个重要模块：局部语义特征提取和结构化人表示学习。局部语义特征提取模块旨在从人体语义区域中提取局部特征。在获得局部特征后，提出了结构化人表示学习，通过考虑人的结构来融合局部特征。为了对底层人物结构进行建模，使用图卷积网络来捕获不同语义区域的关系。生成的结构化特征对底层人物结构信息进行编码，局部语义特征可以解决特征匹配中姿态变化引起的错位问题。通过将它们组合在一起，我们可以提高生成表示的描述能力。对四个标准基准的广泛评估表明，我们提出的方法与最先进的方法相比具有竞争力。结构化人表示学习被提出通过考虑人的结构来融合局部特征。为了对底层人物结构进行建模，使用图卷积网络来捕获不同语义区域的关系。生成的结构化特征对底层人物结构信息进行编码，局部语义特征可以解决特征匹配中姿态变化引起的错位问题。通过将它们组合在一起，我们可以提高生成表示的描述能力。对四个标准基准的广泛评估表明，我们提出的方法与最先进的方法相比具有竞争力。结构化人表示学习被提出通过考虑人的结构来融合局部特征。为了对底层人物结构进行建模，使用图卷积网络来捕获不同语义区域的关系。生成的结构化特征对底层人物结构信息进行编码，局部语义特征可以解决特征匹配中姿态变化引起的错位问题。通过将它们组合在一起，我们可以提高生成表示的描述能力。对四个标准基准的广泛评估表明，我们提出的方法与最先进的方法相比具有竞争力。使用图卷积网络来捕获不同语义区域的关系。生成的结构化特征对底层人物结构信息进行编码，局部语义特征可以解决特征匹配中姿态变化引起的错位问题。通过将它们组合在一起，我们可以提高生成表示的描述能力。对四个标准基准的广泛评估表明，我们提出的方法与最先进的方法相比具有竞争力。使用图卷积网络来捕获不同语义区域的关系。生成的结构化特征对底层人物结构信息进行编码，局部语义特征可以解决特征匹配中姿态变化引起的错位问题。通过将它们组合在一起，我们可以提高生成表示的描述能力。对四个标准基准的广泛评估表明，我们提出的方法与最先进的方法相比具有竞争力。

更新日期：2020-12-17

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>