顺序缩放稀疏因子回归

论文标题

顺序缩放稀疏因子回归

Sequential scaled sparse factor regression

论文作者

Zheng, Zemin, Li, Yang, Wu, Jie, Wang, Yuchen

论文摘要

多元响应和预测因素之间的大规模关联分析非常重要，这是社交媒体营销和危机管理在内的现代业务应用的例证。尽管有方法论的快速进步，但在一般的噪声协方差结构下，如何通过自由调整正则化参数来获得可扩展的估计器仍不清楚。在本文中，我们基于一种新的观点开发了一种称为顺序缩放稀疏因子回归（SES）的新方法，即可以通过常规的特征性分解恢复共同低级别和稀疏回归系数矩阵的问题。它结合了顺序估计的优势和缩放稀疏回归，从而共享了从两种方法继承的稀疏参数的可扩展性和无调属性。逐步的凸式公式，顺序因子回归框架和不敏感性使得对大数据应用程序高度可扩展。还提供了对高维多响应回归的新见解的全面理论依据。我们通过模拟研究和库存简短的兴趣数据分析证明了所提出方法的可伸缩性和有效性。

Large-scale association analysis between multivariate responses and predictors is of great practical importance, as exemplified by modern business applications including social media marketing and crisis management. Despite the rapid methodological advances, how to obtain scalable estimators with free tuning of the regularization parameters remains unclear under general noise covariance structures. In this paper, we develop a new methodology called sequential scaled sparse factor regression (SESS) based on a new viewpoint that the problem of recovering a jointly low-rank and sparse regression coefficient matrix can be decomposed into several univariate response sparse regressions through regular eigenvalue decomposition. It combines the strengths of sequential estimation and scaled sparse regression, thus sharing the scalability and the tuning free property for sparsity parameters inherited from the two approaches. The stepwise convex formulation, sequential factor regression framework, and tuning insensitiveness make SESS highly scalable for big data applications. Comprehensive theoretical justifications with new insights into high-dimensional multi-response regressions are also provided. We demonstrate the scalability and effectiveness of the proposed method by simulation studies and stock short interest data analysis.

下载PDF全文

下载文献需遵守相关版权规定

论文标题