论文标题

Mahalanobis平衡:关于近似协变量平衡的多元视角

Mahalanobis balancing: a multivariate perspective on approximate covariate balancing

论文作者

Dai, Yimin, Yan, Ying

论文摘要

在过去的十年中,将各种确切的基于平衡的加权方法引入了因果推论文献。确切的平衡减轻了极端的重量和模型错误指定问题,这些问题可能会在实施反概率加权时产生。它通过在优化问题中施加平衡约束来消除协变量失衡。但是,当处理组和对照组的协变量之间或协变量高维时,优化问题可能是不可行的。最近,提出了近似平衡作为替代平衡框架,该框架通过使用不平等力矩约束来解决可行性问题。但是,当约束数量较大时,很难选择阈值参数。此外,力矩约束可能无法完全捕获协变量分布的差异。在本文中,我们提出了Mahalanobis的平衡,从多元角度来看,该平衡可以平衡协变量分布。我们使用二次约束来控制一个单个阈值参数的总体不平衡,可以通过简单的选择过程来调整。我们表明,Mahalanobis平衡的双重问题是一个基于L_2规范的正则回归问题,并与倾向分数模型建立了有趣的联系。我们进一步将Mahalanobis均衡到高维情况。我们得出了渐近特性,并与数值研究中现有的平衡方法进行了广泛的比较。

In the past decade, various exact balancing-based weighting methods were introduced to the causal inference literature. Exact balancing alleviates the extreme weight and model misspecification issues that may incur when one implements inverse probability weighting. It eliminates covariate imbalance by imposing balancing constraints in an optimization problem. The optimization problem can nevertheless be infeasible when there is bad overlap between the covariate distributions in the treated and control groups or when the covariates are high-dimensional. Recently, approximate balancing was proposed as an alternative balancing framework, which resolves the feasibility issue by using inequality moment constraints instead. However, it can be difficult to select the threshold parameters when the number of constraints is large. Moreover, moment constraints may not fully capture the discrepancy of covariate distributions. In this paper, we propose Mahalanobis balancing, which approximately balances covariate distributions from a multivariate perspective. We use a quadratic constraint to control overall imbalance with a single threshold parameter, which can be tuned by a simple selection procedure. We show that the dual problem of Mahalanobis balancing is an l_2 norm-based regularized regression problem, and establish interesting connection to propensity score models. We further generalize Mahalanobis balancing to the high-dimensional scenario. We derive asymptotic properties and make extensive comparisons with existing balancing methods in the numerical studies.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源