Skip to main content
Log in

An Efficient Algorithm Combining Spectral Clustering with Feature Selection

  • Published:
Neural Processing Letters Aims and scope Submit manuscript

Abstract

Traditional clustering algorithms have some limitations, which are sensitive to noise and mostly applicable to convex data sets. To solve these problems, the paper proposes a novel algorithm combining spectral clustering with feature selection. Specifically, the loss item is marked with a root that can reduce the deviation value then improve the robustness of the model. And in the algorithm optimization, there is one parameter is represented by other known parameters, which can reduce the time of parameter adjustment. Then, the regular term \({{\ell }_{2,p}}\text {-norm}\) is applied to reduce the influence of noise and redundant features and prevent the model from overfitting. Finally, Laplace matrix is constructed by kNN algorithm which is used to learn subspace and to preserve the local structure among samples, and the data after dimension reduction is used to spectral clustering. Experimental analysis on 10 benchmark datasets show that the proposed algorithm is more outperformed than the algorithms of the state-of-the-art.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

References

  1. Chaudhari S, Cabric D (2017) Unsupervised frequency clustering algorithm for null space estimation in wideband spectrum sharing networks. In: 2017 IEEE global conference on signal and information processing, pp 224–228

  2. Elhamifar E, Vidal R (2013) Sparse subspace clustering: algorithm, theory, and applications. IEEE Trans Pattern Anal Mach Intell 35(11):2765–2781

    Article  Google Scholar 

  3. Gao L, Li X, Song J, Shen HT (2019) Hierarchical LSTMS with adaptive attention for visual captioning. IEEE Trans Pattern Anal Mach Intell

  4. Guo G, Wang H, Bell D, Bi Y, Greer K (2003) Knn model-based approach in classification. In: OTM confederated international conferences “on the move to meaningful internet systems”, pp 986–996

  5. Guo Y, Wu Z, Shen D (2019) Learning longitudinal classification-regression model for infant hippocampus segmentation. Neurocomputing

  6. Hand DJ (2007) Principles of data mining. Drug Saf 30(7):621–622

    Article  Google Scholar 

  7. Hao S, Zhou Y, Guo Y (2020) A brief survey on semantic segmentation with deep learning. Neurocomputing

  8. Hartigan JA, Wong MA (1979) Algorithm as 136: a k-means clustering algorithm. J R Stat Soc Ser C (Applied Statistics) 28(1):100–108

    MATH  Google Scholar 

  9. Hou C, Nie F, Li X, Yi D, Wu Y (2013) Joint embedding learning and sparse regression: a framework for unsupervised feature selection. IEEE Trans Cybern 44(6):793–804

    Google Scholar 

  10. Hu R, Zhu X, Zhu Y, Gan J (2019) Robust SVM with adaptive graph learning. World Wide Web. https://doi.org/10.1007/s11280-019-00766-x

  11. Huang D, Wang CD, Wu J, Lai JH, Kwoh CK (2019) Ultra-scalable spectral clustering and ensemble clustering. IEEE Trans Knowl Data Eng

  12. Jain AK, Murty MN, Flynn PJ (1999) Data clustering:a review. ACM Comput Surv 31(3):264–323

    Article  Google Scholar 

  13. Ji Y, Zhan Y, Yang Y, Xu X, Shen F, Shen HT (2019) A context knowledge map guided coarse-to-fine action recognition. IEEE Trans Image Process

  14. Jia H, Ding S, Ma H, Xing W (2014) Spectral clustering with neighborhood attribute reduction based on information entropy. J Comput 9(6):1316–1324

    Article  Google Scholar 

  15. Jia H, Ding S, Xu X, Nie R (2014) The latest research progress on spectral clustering. Neural Comput Appl 24(7–8):1477–1486

    Article  Google Scholar 

  16. Kannan SR, Sathya A, Sathya A, Sathya A (2011) Robust kernel FCM in segmentation of breast medical images. Expert Syst Appl Int J 38(4):4382–4389

    Article  Google Scholar 

  17. Li W, Godzik A (2006) Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22(13):1658–1659

    Article  Google Scholar 

  18. Li Z, Zhang Z, Qin J, Zhang Z, Shao L (2019) Discriminative fisher embedding dictionary learning algorithm for object recognition. IEEE Trans Neural Netw Learn Syst

  19. Liu Y, Jiao L, Shang F (2013) An efficient matrix factorization based low-rank representation for subspace clustering. Pattern Recognit 46(1):284–292

    Article  Google Scholar 

  20. Lucinska M (2014) A spectral clustering algorithm based on eigenvector localization. In: International conference on artificial intelligence and soft computing. Springer, Berlin, pp 749–759

  21. Malik J, Belongie S, Leung T, Shi J (2001) Contour and texture analysis for image segmentation. Int J Comput Vis 43(1):7–27

    Article  Google Scholar 

  22. Nie F, Wang CL, Li X (2019) K-multiple-means: a multiple-means clustering method with specified k clusters. In: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery and data mining, pp 959–967

  23. Nie F, Wang X, Jordan MI, Huang H (2016) The constrained laplacian rank algorithm for graph-based clustering. In: Proceedings of the thirtieth conference on artificial intelligence, pp 1969–1976

  24. Nie F, Zhang R, Li X (2017) A generalized power iteration method for solving quadratic problem on the stiefel manifold. Sci China Inf Sci 60(11):112101

    Article  MathSciNet  Google Scholar 

  25. Shen HT, Liu L, Yang Y, Xu X, Huang Z, Shen F, Hong R (2020) Exploiting subspace relation in semantic labels for cross-modal hashing. IEEE Trans Knowl Data Eng

  26. Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905

    Article  Google Scholar 

  27. Tremblay N, Puy G, Gribonval R, Vandergheynst P (2016) Compressive spectral clustering. In: International conference on machine learning, pp 1002–1011

  28. Wang B, Yang Y, Xu X, Hanjalic A, Shen HT (2017) Adversarial cross-modal retrieval. In: Proceedings of the 25th ACM international conference on multimedia, pp 154–162

  29. Weiss Y (1999) Segmentation using eigenvectors: a unifying view. In: Proceedings of the seventh IEEE international conference on computer vision, vol 2. IEEE, pp 975–982

  30. Xie G, Zhang Z, Liu L, Zhu F, Zhang X, Shao L, Li X (2019) SRSC: selective, robust, and supervised constrained feature representation for image classification. IEEE Trans Neural Netw Learn Syst (2019)

  31. Yang Y, Duan Y, Wang X, Huang Z, Xie N, Shen HT (2018) Hierarchical multi-clue modelling for poi popularity prediction with heterogeneous tourist information. IEEE Trans Knowl Data Eng 31(4):757–768

    Article  Google Scholar 

  32. Yang Y, Ma Z, Yang Y, Nie F, Shen HT (2014) Multitask spectral clustering by exploring intertask correlation. IEEE Trans Cybern 45(5):1083–1094

    Article  Google Scholar 

  33. Yang Y, Rutayisire T, Lin C, Li T, Teng F (2013) An improved cop-kmeans clustering for solving constraint violation based on mapreduce framework. Fundam Inform 126(4):301–318

    Article  MathSciNet  Google Scholar 

  34. Yang Y, Shen HT, Ma Z, Huang Z, Zhou X (2011) L2, 1 norm regularized discriminative feature selection for unsupervised learning. In: Proceedings of the 22nd international joint conference on artificial intelligence, pp 1589–1594

  35. Zhang J, Yu J, Wan J, Zeng Z (2015) L2, 1 norm regularized fisher criterion for optimal feature selection. Neurocomputing 166:455–463

    Article  Google Scholar 

  36. Zhang Y, Jin J, Qing X, Wang B, Wang X (2012) Lasso based stimulus frequency recognition model for ssvep bcis. Biomed Signal Process Control 7(2):104–111

    Article  Google Scholar 

  37. Zhang Y, Zhao Q, Jin J, Wang X, Cichocki A (2012) A novel BCI based on erp components sensitive to configural processing of human faces. J Neural Eng 9(2):26018

    Article  Google Scholar 

  38. Zhang Z, Lai Z, Huang Z, Wong WK, Xie GS, Liu L, Shao L (2019) Scalable supervised asymmetric hashing with semantic and latent factor embedding. IEEE Trans Image Process 28(10):4803–4818

    Article  MathSciNet  Google Scholar 

  39. Zhang Z, Liu L, Shen F, Shen HT, Shao L (2018) Binary multi-view clustering. IEEE Trans Pattern Anal Mach Intell 41(7):1774–1782

    Article  Google Scholar 

  40. Zhou Y, Tian L, Zhu C, Jin X, Sun Y (2019) Video coding optimization for virtual reality 360-degree source. IEEE J Sel Topi Signal Process

  41. Zhu X, Gan J, Lu G, Li J, Zhang S (2019) Spectral clustering via half-quadratic optimization. World Wide Web, pp 1–20

  42. Zhu X, Yang J, Zhang C, Zhang S (2019) Efficient utilization of missing data in cost-sensitive learning. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2019.2956530

  43. Zhu X, Zhang S, Zhu Y, Zheng W (2020) Self-weighted multi-view fuzzy clustering. In: ACM transactions on knowledge discovery from data (TKDD)

  44. Zhu X, Zhu Y, Zheng W (2019) Spectral rotation for deep one-step clustering. Pattern Recognit. https://doi.org/10.1016/j.patcog.2019.107175

Download references

Acknowledgements

This work is partially supported by the China Key Research Program (Grant No. 2016YFB1000905); the Key Program of the National Natural Science Foundation of China (Grant No. 61836016); the Natural Science Foundation of China (Grants Nos. 61876046, 61573270, 81701780, 61672177 and 61972177); the Project of Guangxi Science and Technology (GuiKeAD17195062); the Guangxi Natural Science Foundation (Grant No. 2017GXNSFBA198221); the Guangxi Collaborative Innovation Center of Multi-Source Information Integration and Intelligent Processing; the Guangxi High Institutions Program of Introducing 100 High-Level Overseas Talents; the Research Fund of Guangxi Key Lab of Multisource Information Mining & Security (18-A-01-01); and 2019 basic scientific research capability enhancement project for middle-aged teachers in guangxi university (2019KY0062); Innovation Project of Guangxi Graduate Education (YCBZ2020038, JGY2020026).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guoqiu Wen.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Luo, Q., Wen, G., Zhang, L. et al. An Efficient Algorithm Combining Spectral Clustering with Feature Selection. Neural Process Lett 52, 1913–1925 (2020). https://doi.org/10.1007/s11063-020-10297-6

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11063-020-10297-6

Keywords

Navigation