论文标题
高维协方差矩阵的主要回归
Principal Regression for High Dimensional Covariance Matrices
论文作者
论文摘要
该手稿提出了一种以多个高维协方差矩阵作为结果进行通用线性回归的方法。提出模型参数是通过最大化伪样性来估计的。当数据具有较高的尺寸时,正常的似然函数将不适合,因为样品协方差矩阵是缺陷的。因此,引入了协方差矩阵的良好条件线性收缩估计量。使用多个协方差矩阵,提出收缩系数在整个矩阵中很常见。理论研究表明,在身份矩阵的所有线性组合和样品协方差矩阵中,提出的协方差矩阵估计器在渐近渐近均匀地渐近地实现了最佳最小二次损失。在规律性条件下,提出的模型参数估计器是一致的。通过模拟研究说明了所提出的方法的优越性能,而不是现有方法。该拟议方法实施了从阿尔茨海默氏病神经影像倡议中获得的静止状态功能磁共振成像研究,该方法确定了一个大脑网络,其中功能连通性与载脂蛋白E $ \ varepsilon $ 4显着相关,这是阿尔茨海默氏病的强大遗传标记。
This manuscript presents an approach to perform generalized linear regression with multiple high dimensional covariance matrices as the outcome. Model parameters are proposed to be estimated by maximizing a pseudo-likelihood. When the data are high dimensional, the normal likelihood function is ill-posed as the sample covariance matrix is rank-deficient. Thus, a well-conditioned linear shrinkage estimator of the covariance matrix is introduced. With multiple covariance matrices, the shrinkage coefficients are proposed to be common across matrices. Theoretical studies demonstrate that the proposed covariance matrix estimator is optimal achieving the uniformly minimum quadratic loss asymptotically among all linear combinations of the identity matrix and the sample covariance matrix. Under regularity conditions, the proposed estimator of the model parameters is consistent. The superior performance of the proposed approach over existing methods is illustrated through simulation studies. Implemented to a resting-state functional magnetic resonance imaging study acquired from the Alzheimer's Disease Neuroimaging Initiative, the proposed approach identified a brain network within which functional connectivity is significantly associated with Apolipoprotein E $\varepsilon$4, a strong genetic marker for Alzheimer's disease.