当前位置: X-MOL 学术IEEE Trans. Cybern. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Adaptive Maximum Entropy Graph-Guided Fast Locality Discriminant Analysis
IEEE Transactions on Cybernetics ( IF 11.8 ) Pub Date : 2021-11-24 , DOI: 10.1109/tcyb.2021.3125956
Feiping Nie 1 , Xiaowei Zhao 2 , Rong Wang 3 , Xuelong Li 3
Affiliation  

Linear discriminant analysis (LDA) aims to find a low-dimensional space in which data points in the same class are to be close to each other while keeping data points from different classes apart. To improve the robustness of LDA to non-Gaussian distribution data, most existing discriminant analysis methods extend LDA by approximating the underlying manifold of data. However, these methods suffer from the following problems: 1) local affinity or reconstruction coefficients are learned on the basis of the relationships of all data pairs, which would lead to a sharp increase in the amount of computation and 2) they learn the manifold information in the original space, ignoring the interference of the noise and redundant features. Motivated by these challenges, this article represents a novel discriminant analysis model, called fast and adaptive locality discriminant analysis (FALDA), to improve the efficiency and robustness. First, with the anchor-based strategy, a bipartite graph of each class is constructed to characterize the local structure of data. Since the number of anchor points is far less than that of data points, learning of fuzzy membership relationships between data points and anchor points within each class can save training time. Second, a maximum entropy regularization is introduced to control the uniformity of the weights of graphs and avoid the trivial solution. Third, the above relationships are updated adaptively in the process of dimensionality reduction, which can suppress the interference of the noise and redundant features. Fourth, the whitening constraint is imposed on the projection matrix to remove the relevance between features and restrict the total scatter of data in the subspace. Last but not the least, data with complex distribution can be explicitly divided into sub-blocks according to the learned anchor points (or subclass center points). We test our proposed method on synthetic data, benchmark datasets, and imbalanced datasets. Promising experimental results demonstrate the success of this novel model.

中文翻译:

自适应最大熵图引导的快速局部判别分析

线性判别分析 (LDA) 旨在找到一个低维空间,在该空间中,同一类的数据点要彼此靠近,同时保持不同类的数据点分开。为了提高 LDA 对非高斯分布数据的鲁棒性,大多数现有的判别分析方法通过近似数据的底层流形来扩展 LDA。然而,这些方法存在以下问题:1)局部亲和度或重建系数是在所有数据对的关系的基础上学习的,这将导致计算量急剧增加;2)它们学习流形信息在原始空间中,忽略噪声和冗余特征的干扰。在这些挑战的推动下,本文提出了一种新颖的判别分析模型,称为快速自适应局部判别分析(FALDA),以提高效率和鲁棒性。首先,使用基于锚点的策略,构建每个类的二分图来表征数据的局部结构。由于锚点的数量远远少于数据点的数量,学习每个类内数据点和锚点之间的模糊隶属关系可以节省训练时间。其次,引入最大熵正则化来控制图权重的均匀性,避免平凡解。第三,上述关系在降维过程中自适应更新,可以抑制噪声和冗余特征的干扰。第四,对投影矩阵施加白化约束以去除特征之间的相关性并限制子空间中数据的总分散。最后但并非最不重要的是,可以根据学习到的锚点(或子类中心点)将具有复杂分布的数据显式划分为子块。我们在合成数据、基准数据集和不平衡数据集上测试了我们提出的方法。有希望的实验结果证明了这种新模型的成功。
更新日期:2021-11-24
down
wechat
bug