当前位置: X-MOL 学术IEEE Trans. Image Process. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Hamming Embedding Sensitivity Guided Fusion Network for 3D Shape Representation.
IEEE Transactions on Image Processing ( IF 10.8 ) Pub Date : 2020-08-05 , DOI: 10.1109/tip.2020.3013138
Biao Gong , Chenggang Yan , Junjie Bai , Changqing Zou , Yue Gao

Three-dimensional multi-modal data are used to represent 3D objects in the real world in different ways. Features separately extracted from multimodality data are often poorly correlated. Recent solutions leveraging the attention mechanism to learn a joint-network for the fusion of multimodality features have weak generalization capability. In this paper, we propose a hamming embedding sensitivity network to address the problem of effectively fusing multimodality features. The proposed network called HamNet is the first end-to-end framework with the capacity to theoretically integrate data from all modalities with a unified architecture for 3D shape representation, which can be used for 3D shape retrieval and recognition. HamNet uses the feature concealment module to achieve effective deep feature fusion. The basic idea of the concealment module is to re-weight the features from each modality at an early stage with the hamming embedding of these modalities. The hamming embedding also provides an effective solution for fast retrieval tasks on a large scale dataset. We have evaluated the proposed method on the large-scale ModelNet40 dataset for the tasks of 3D shape classification, single modality and cross-modality retrieval. Comprehensive experiments and comparisons with state-of-the-art methods demonstrate that the proposed approach can achieve superior performance.

中文翻译:


用于 3D 形状表示的汉明嵌入灵敏度引导融合网络。



三维多模态数据用于以不同方式表示现实世界中的 3D 对象。从多模态数据中单独提取的特征通常相关性很差。最近利用注意力机制学习联合网络来融合多模态特征的解决方案泛化能力较弱。在本文中,我们提出了一种汉明嵌入敏感度网络来解决有效融合多模态特征的问题。所提出的名为 HamNet 的网络是第一个端到端框架,理论上能够将所有模态的数据与 3D 形状表示的统一架构集成,可用于 3D 形状检索和识别。 HamNet利用特征隐藏模块实现有效的深度特征融合。隐藏模块的基本思想是在早期阶段通过这些模态的汉明嵌入来重新加权每个模态的特征。汉明嵌入还为大规模数据集上的快速检索任务提供了有效的解决方案。我们在大规模 ModelNet40 数据集上针对 3D 形状分类、单模态和跨模态检索任务评估了所提出的方法。综合实验以及与最先进方法的比较表明,所提出的方法可以实现卓越的性能。
更新日期:2020-08-21
down
wechat
bug