当前位置: X-MOL 学术Neurocomputing › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
MVSN: A Multi-view stack network for human parsing
Neurocomputing ( IF 6 ) Pub Date : 2021-09-03 , DOI: 10.1016/j.neucom.2021.08.124
Zhuo Su 1 , Minshi Chen 1 , Enbo Huang 1 , Ge Lin 1 , Fan Zhou 1
Affiliation  

The human parsing task is dedicated to allocating pixel-level fine-grained semantic labels. However, recent solutions are limited to fully utilize the prior information (poses and edges) and the potential information of the image data, remaining problems with ambiguous boundaries, incomplete human parts, and redundant labels. To solve the above problems, we propose a novel Multi-view Stack Network (MVSN), which is constructed by three stacks with multiple views of features included parts, edges, and pre-segmentation. Meanwhile, a channel correlator is developed to acquire the correlation between local and global information better. Comprehensive experiments and corresponding results on three public datasets show that the proposed MVSN performs favorably against the state-of-the-art methods.



中文翻译:

MVSN:用于人类解析的多视图堆栈网络

人工解析任务致力于分配像素级细粒度语义标签。然而,最近的解决方案仅限于充分利用图像数据的先验信息(姿势和边缘)和潜在信息,存在边界模糊、人体部分不完整和标签冗余的问题。为了解决上述问题,我们提出了一种新颖的多视图堆栈网络(MVSN),它由三个堆栈构成,具有多个特征视图,包括部分、边缘和预分割。同时,开发了信道相关器以更好地获取局部和全局信息之间的相关性。在三个公共数据集上的综合实验和相应结果表明,所提出的 MVSN 与最先进的方法相比表现良好。

更新日期:2021-09-23
down
wechat
bug