Abstract
This paper addresses the issue of testing sphericity and identity of high-dimensional population covariance matrix when the data dimension exceeds the sample size. The central limit theorem of the first four moments of eigenvalues of sample covariance matrix is derived using random matrix theory for generally distributed populations. Further, some desirable asymptotic properties of the proposed test statistics are provided under the null hypothesis as data dimension and sample size both tend to infinity. Simulations show that the proposed tests have a greater power than existing methods for the spiked covariance model.
Similar content being viewed by others
References
Anderson, T.W. An Introduction to Multivariate Statistical Analysis (3th edition). John Wiley & Sons, 2003.
Bai, Z.D., Jiang, D.D., Yao, J.F., Zheng, S.R. Corrections to LRT on large-dimensional covariance matrix by RMT. The Annals of Statistics, 37: 3822–3840 (2009)
Bai, Z.D., Silverstein, J.W. CLT for linear spectral statistics of large-dimensional sample covariance matrices. The Annals of Probability, 32: 553–605 (2004)
Bai, Z.D., Silverstein, J.W. Spectral Analysis of Large Dimensional Random Matrices (2th edition). Springer, 2010.
Bianchi, P., Debbah, M., Maida, M., Najim, J. Performance of statistical tests for single-source detection using random matrix theory. IEEE Trans. Inf. Theory, 57: 2400–2419 (2011)
Birke, M., Dette, H. A note on testing the covariance matrix for large dimension. Stat. Probab. Lett., 74: 281–289 (2005)
Chen, S.X., Zhang, L.X., Zhong, P.S. Tests for high-dimensional covariance matrices. J. Am. Stat. Assoc., 105: 810–819 (2010)
Couillet, R., Debbah, M. Random Matrix Methods for Wireless Communications. Cambridge University Press, 2011.
Couillet, R., Hachem, W. Fluctuations of spiked random matrix models and failure diagnosis in sensor networks. IEEE Trans. Inf. Theory, 59: 509–525 (2013)
Dettling, M., Bhlmann, P. Boosting for tumor classification with gene expression data. Bioinformatics, 19: 1061–1069 (2003)
Dudoit, S., Fridlyand, J., Speed, T.P. Comparison of discrimination methods for the classification of tumors using gene expression data. J. Am. Stat. Assoc., 97: 77–87 (2002)
Fisher, T.J. On testing for an identity covariance matrix when the dimensionality equals or exceeds the sample size. J. Stat. Plan. Infer., 142: 312–326 (2012)
Fisher, T.J., Sun, X.Q., Gallagher, C.M. A new test for sphericity of the covariance matrix for high dimensional data. J. Multivar. Anal., 101: 2554–2570 (2010)
Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science, 286: 531–537 (1999)
Hu, J., Li, W., Liu, Z., Zhou, W. High-dimensional covariance matrices in elliptical distributions with application to spherical test. The Annals of Statistics, 47: 527–555 (2019)
John, S. Some optimal multivariate tests. Biometrika, 58: 123–127 (1971)
Johnstone, I.M. On the distribution of the largest eigenvalue in principal components analysis. The Annals of Statistics, 29: 295–327 (2001)
Ledoit, O., Wolf, M. Some hypothesis tests for the covariance matrix when the dimension is large compared to the sample size. The Annals of Statistics, 30: 1081–1102 (2002)
Nadakuditi, R., Edelman, A. Sample eigenvalue based detection of high-dimensional signals in white noise using relatively few samples. IEEE Trans. Signal. Process., 56: 2625–2638 (2008)
Nagao, H. On some test criteria for covariance matrix. The Annals of Statistics, 1: 700–709 (2002)
Pan, G.M., Zhou, W. Central limit theorem for signal-to-interference ratio of reduced rank linear receiver. The Annals of Applied Probability, 18: 1232–1270 (2008)
Silverstein, J.W. Strong convergence of the empirical distribution of eigenvalues of large dimensional random matrices. J. Multivar. Anal., 55: 331–339 (1995)
Srivastava, M.S. Some tests concerning the covariance matrix in high dimensional data. Journal of the Japan Statistical Society, 35: 251–272 (2005)
Srivastava, M.S. Some tests criteria for the covariance matrix with fewer observations than the dimension. Acta Et Commentationes Universitatis Tartuensis De Mathematica, 10: 77–93 (2006)
Srivastava, M.S., Kollo, T., von Rosen, D. Some tests for the covariance matrix with fewer observations than the dimension under non-normality. J. Multivar. Anal., 102: 1090–1103 (2011)
Srivastava, M.S., Yanagihara, H. Testing the equality of several covariance matrices with fewer observations than the dimension. J. Multivar. Anal., 101: 1319–1329 (2010)
Tian, X.T., Lu, Y.T., Li, W.M. A robust test for sphericity of high-dimensional covariance matrices. J. Multivar. Anal., 141: 217–227 (2015)
Torun, M., Akansu, A., Avellaneda, M. Portfolio risk in multiple frequencies. IEEE Signal Process. Mag., 28: 61–71 (2011)
Wang, C., Yang, J., Miao, B., Cao, L. Identity tests for high dimensional data using RMT. J. Multivar. Anal., 118: 128–137 (2013)
Wang, C., Yang, J., Miao, B., Cao, L. Testing identity of high-dimensional covariance matrix. J. Stat. Comput. Simul., 88: 2600–2611 (2018)
Wang, Q.W., Yao, J.F. On the sphericity test with large-dimensional observations. Electron. J. Stat., 7: 2164–2192 (2013)
Zheng, S.R. Central limit theorems for linear spectral statistics of large dimensional F-matrices. Annales de lInstitut Henri Poincar, Probabilits et Statistiques, 48: 444–476 (2012)
Author information
Authors and Affiliations
Corresponding author
Additional information
This paper is supported by the National Natural Science Foundation of China (Nos. 61374027, 11871357) and by the Sichuan Science and Technology Program (Nos. 2019YJ0122).
Rights and permissions
About this article
Cite this article
Yuan, Sc., Zhou, J., Pan, Jx. et al. Sphericity and Identity Test for High-dimensional Covariance Matrix Using Random Matrix Theory. Acta Math. Appl. Sin. Engl. Ser. 37, 214–231 (2021). https://doi.org/10.1007/s10255-021-1004-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10255-021-1004-1