Task-aware Attention Model for Clothing Attribute Prediction,IEEE Transactions on Circuits and Systems for Video Technology

当前位置： X-MOL 学术 › IEEE Trans. Circ. Syst. Video Technol. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Task-aware Attention Model for Clothing Attribute Prediction
IEEE Transactions on Circuits and Systems for Video Technology ( IF 8.4 ) Pub Date : 2020-04-01 , DOI: 10.1109/tcsvt.2019.2902268
Sanyi Zhang , Zhanjie Song , Xiaochun Cao , Hua Zhang , Jie Zhou

Clothing attribute recognition, especially in unconstrained street images, is a challenging task for multimedia. Existing methods for multi-task clothing attribute prediction often ignore the relation between specific attributes and positions. However, the attribute response is always location-sensitive, i.e., different spatial locations have various contributions to attributes. Inspired by the locality of clothing attributes, in this paper, we introduce the attention mechanism to incorporate the impact of positions for clothing attribute prediction with only image-level annotations. However, the performance improvement is limited if we directly use the traditional spatial attention model for each task since it does not take the influence from other tasks into account. Instead, we propose a novel task-aware attention mechanism, which estimates the importance of each position across different tasks. We first evaluate a task attention network with an end-to-end multi-task clothing attribute learning architecture on the shop domain. And then, we employ curriculum learning strategy, which transfers the well-trained shop domain attribute knowledge to the street domain attribute prediction. Experiments are conducted on three clothing benchmarks, i.e., cross-domain clothing attribute dataset, woman clothing dataset, and man clothing dataset. The performance of attribute prediction demonstrates the superiority of the proposed task-aware attention mechanism over several state-of-the-art methods both in shop and street domains.

中文翻译：

服装属性预测的任务感知注意力模型

服装属性识别，尤其是在无约束的街道图像中，对于多媒体来说是一项具有挑战性的任务。现有的多任务服装属性预测方法往往忽略特定属性与位置之间的关系。然而，属性响应总是位置敏感的，即不同的空间位置对属性有不同的贡献。受服装属性局部性的启发，在本文中，我们引入了注意力机制，将位置对服装属性预测的影响与仅图像级注释结合起来。然而，如果我们对每个任务直接使用传统的空间注意力模型，性能提升是有限的，因为它没有考虑其他任务的影响。相反，我们提出了一种新颖的任务感知注意力机制，它估计每个职位在不同任务中的重要性。我们首先在商店领域评估具有端到端多任务服装属性学习架构的任务注意力网络。然后，我们采用课程学习策略，将训练有素的商店领域属性知识转移到街道领域属性预测中。在三个服装基准上进行了实验，即跨域服装属性数据集、女装数据集和男装数据集。属性预测的性能证明了所提出的任务感知注意机制在商店和街道领域的几种最先进方法的优越性。我们首先在商店领域评估具有端到端多任务服装属性学习架构的任务注意力网络。然后，我们采用课程学习策略，将训练有素的商店领域属性知识转移到街道领域属性预测中。在三个服装基准上进行了实验，即跨域服装属性数据集、女装数据集和男装数据集。属性预测的性能证明了所提出的任务感知注意机制在商店和街道领域中优于几种最先进的方法。我们首先在商店领域评估具有端到端多任务服装属性学习架构的任务注意力网络。然后，我们采用课程学习策略，将训练有素的商店领域属性知识转移到街道领域属性预测中。在三个服装基准上进行了实验，即跨域服装属性数据集、女装数据集和男装数据集。属性预测的性能证明了所提出的任务感知注意机制在商店和街道领域的几种最先进方法的优越性。在三个服装基准上进行了实验，即跨域服装属性数据集、女装数据集和男装数据集。属性预测的性能证明了所提出的任务感知注意机制在商店和街道领域中优于几种最先进的方法。在三个服装基准上进行了实验，即跨域服装属性数据集、女装数据集和男装数据集。属性预测的性能证明了所提出的任务感知注意机制在商店和街道领域的几种最先进方法的优越性。

更新日期：2020-04-01

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>