Elsevier

Journal of Multivariate Analysis

Volume 143, January 2016, Pages 107-126
Journal of Multivariate Analysis

Inference for high-dimensional differential correlation matrices

https://doi.org/10.1016/j.jmva.2015.08.019Get rights and content
Under an Elsevier user license
open archive

Abstract

Motivated by differential co-expression analysis in genomics, we consider in this paper estimation and testing of high-dimensional differential correlation matrices. An adaptive thresholding procedure is introduced and theoretical guarantees are given. Minimax rate of convergence is established and the proposed estimator is shown to be adaptively rate-optimal over collections of paired correlation matrices with approximately sparse differences. Simulation results show that the procedure significantly outperforms two other natural methods that are based on separate estimation of the individual correlation matrices. The procedure is also illustrated through an analysis of a breast cancer dataset, which provides evidence at the gene co-expression level that several genes, of which a subset has been previously verified, are associated with the breast cancer. Hypothesis testing on the differential correlation matrices is also considered. A test, which is particularly well suited for testing against sparse alternatives, is introduced. In addition, other related problems, including estimation of a single sparse correlation matrix, estimation of the differential covariance matrices, and estimation of the differential cross-correlation matrices, are also discussed.

AMS 2010 subject classification

primary
62H12
secondary
62F12

Keywords

Adaptive thresholding
Covariance matrix
Differential co-expression analysis
Differential correlation matrix
Optimal rate of convergence
Sparse correlation matrix
Thresholding

Cited by (0)

The research was supported in part by National Science Foundation Grants DMS-1208982 and DMS-1403708, and National Institutes of Health Grant R01 CA127334.