当前位置: X-MOL 学术Multimed. Tools Appl. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Cross-modal subspace learning via kernel correlation maximization and discriminative structure-preserving
Multimedia Tools and Applications ( IF 3.6 ) Pub Date : 2020-05-15 , DOI: 10.1007/s11042-020-08989-1
Jun Yu , Xiao-Jun Wu

How to measure the distance between heterogeneous data is still an open problem. Many research works have been developed to learn a common subspace where the similarity between different modalities can be calculated directly. However, most of existing works focus on learning a latent subspace but the semantically structural information is not well preserved. Thus, these approaches cannot get desired results. In this paper, we propose a novel framework, termed Cross-modal subspace learning via Kernel correlation maximization and Discriminative structure-preserving (CKD), to solve this problem in two aspects. Firstly, we construct a shared semantic graph to make each modality data preserve the neighbor relationship semantically. Secondly, we introduce the Hilbert-Schmidt Independence Criteria (HSIC) to ensure the consistency between feature-similarity and semantic-similarity of samples. Our model not only considers the inter-modality correlation by maximizing the kernel correlation but also preserves the semantically structural information within each modality. The extensive experiments are performed to evaluate the proposed framework on the three public datasets. The experimental results demonstrate that the proposed CKD is competitive compared with the classic subspace learning methods.



中文翻译:

通过核相关最大化和区分结构保留的跨模态子空间学习

如何测量异构数据之间的距离仍然是一个悬而未决的问题。已经开发了许多研究工作来学习公共子空间,在该子空间中可以直接计算不同模态之间的相似性。但是,现有的大多数工作都集中在学习潜在子空间上,但是语义结构信息却得不到很好的保留。因此,这些方法无法获得预期的结果。在本文中,我们提出了一种新颖的框架,即通过核相关最大化和判别结构保留(CKD)进行跨模态子空间学习,以从两个方面解决这一问题。首先,我们构造一个共享的语义图,使每个模态数据在语义上保留邻居关系。其次,我们引入希尔伯特-施密特独立标准(HSIC),以确保样本的特征相似度和语义相似度之间的一致性。我们的模型不仅通过最大化内核相关性来考虑模态间的相关性,还保留了每个模态中的语义结构信息。进行了广泛的实验,以评估三个公共数据集上提出的框架。实验结果表明,与经典子空间学习方法相比,提出的CKD具有竞争力。进行了广泛的实验,以评估三个公共数据集上提出的框架。实验结果表明,与经典子空间学习方法相比,提出的CKD具有竞争力。进行了广泛的实验,以评估三个公共数据集上提出的框架。实验结果表明,与经典子空间学习方法相比,提出的CKD具有竞争力。

更新日期:2020-05-15
down
wechat
bug