Abstract
In light of the remarkable diversity of data, arises an interesting and challenging problem of their description and concise interpretation. In a nutshell, in the proposed description pursued in this study, we consider a framework of information granules. The study develops a general scheme composed of two functional phases: (i) clustering data and features forming segments of original data and delivering a meaningful partition of data, and (ii) development of information granules. In both phases, we discuss a suite of performance indexes quantifying the quality of segments of data and the resulting information granules. Along this line, discussed are collections of information granules and their mutual relationships. A series of publicly available data sets is used in the experiments—their granular signature is quantified, and the quality of these findings is analyzed.
Similar content being viewed by others
References
Sun, S., Shawe-Taylor, J., Mao, L.: PAC-Bayes analysis of multi-view learning. Inf. Fusion 35, 117–131 (2017)
Jiang, B., Qiu, F., Wang, L.: Multi-view clustering via simultaneous weighting on views and features. Appl. Soft Comput. 47, 304–315 (2016)
Pontes, B., Giráldez, R., Aguilar-Ruiz, J.S.: Biclustering on expression data: a review. J. Biomed. Inform. 57, 163–180 (2015)
Zong, L., Zhang, X., Yu, H., Zhao, Q., Ding, F.: Local linear neighbor reconstruction for multi-view data. Pattern Recognit. Lett. 84, 56–62 (2016)
Yin, J., Sun, S.: Multiview uncorrelated locality preserving projection. IEEE Trans. Neural Netw. Learn. Syst. (2019). https://doi.org/10.1109/TNNLS.2019.2944664
Henriques, R., Antunes, C., Madeira, S.C.: A structured view on pattern mining-based biclustering. Pattern Recognit. 48, 3941–3958 (2015)
Bezdek, J.C., Ehrlich, R., Full, W.: FCM: the fuzzy c-means clustering algorithm. Comput. Geosci. 10, 191–203 (1984)
Lunwen, W.: Study of granular analysis in clustering. Comput. Eng. Appl. 5, 29–31 (2006)
Liberti, L., Lavor, C., Maculan, N., Mucherino, A.: Euclidean distance geometry and applications. SIAM Rev. 56, 3–69 (2014)
Dattorro, J.: Convex optimization & euclidean distance geometry. Lulu.com (2010)
Zadeh, L.A.: Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic. Fuzzy Sets Syst. 90, 111–127 (1997)
de Sa, J., Silva, L., Santos, J., Alexandre, L.: Minimum error entropy classification. In Studies in Computational Intelligence. Springer (2013)
Wang, X., Pedrycz, W., Gacek, A., Liu, X.: From numeric data to information granules: a design through clustering and the principle of justifiable granularity. Knowl. Based Syst. 101, 100–113 (2016)
Pedrycz, W., Homenda, W.: Building the fundamentals of granular computing: a principle of justifiable granularity. Appl. Soft Comput. 13, 4209–4218 (2013)
Pedrycz, W., Al-Hmouz, R., Morfeq, A., Balamash, A.: The design of free structure granular mappings: the use of the principle of justifiable granularity. IEEE Trans. Cybern. 43, 2105–2113 (2013)
Pedrycz, W.: The principle of justifiable granularity and an optimization of information granularity allocation as fundamentals of granular computing. J. Inf. Process. Syst. 7, 397–412 (2011)
Balamash, A., Pedrycz, W., Al-Hmouz, R., Morfeq, A.: Granular classifiers and their design through refinement of information granules. Soft. Comput. 21, 2745–2759 (2017)
Machine Learning Data (MLData). https://www.mldata.io/dataset-details/gender_voice/. Accessed 1 Sept 2019
UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/index.php. Accessed 1 Sept 2019
Acknowledgements
This project was funded by the Deanship of Scientific Research (DSR), King Abdulaziz University, Jeddah, Saudi Arabia, under Grant No. (KEP-5-135-39). The authors, therefore, acknowledge with thanks DSR technical and financial support.
Author information
Authors and Affiliations
Corresponding author
Appendix A
Appendix A
Used symbols
Symbol | Description |
---|---|
Di | Data cluster i |
Fj | Feature cluster j |
r | Number of feature clusters |
c | Number of data clusters |
Q | FCM objective variable |
xk | Data point k |
zj | Feature j |
N | Total number of data points |
n | Total number of features |
vi | Data cluster i prototype |
m | Fuzzification coefficient |
uik | The membership value of a data point xk to the data cluster i |
gij | The membership value of a feature zj to the feature cluster i |
Vij | Reconstruction error produced for (Di, Fj) |
\(\left\| \cdot \right\|_{{F_{j} }}\) | Distance completed for features forming Fj |
ρij | The probability class j exists in information granule i |
vij | Data cluster i prototype computed by averaging cluster data points just for features forming Fj |
h | Entropy |
Ci | Vagueness of the ith information granule |
σiy | The variance of the output values of data cluster i |
\(\bar{y}_{i}\) | The average of the output values of data cluster i |
Ri | |
Nij | Number of data points in information granule Gij ≡ (Di, Fj) |
cov | Coverage |
sp | Specify |
\(\hat{y}\) | Predicted y value |
\(\rho\) | Predicted class |
Rights and permissions
About this article
Cite this article
Balamash, A., Pedrycz, W., Al-Hmouz, R. et al. Data Description Through Information Granules: A Multiview Perspective. Int. J. Fuzzy Syst. 22, 1731–1747 (2020). https://doi.org/10.1007/s40815-020-00903-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40815-020-00903-z