深度神经网络对低维流形的双重范围内学习

论文标题

深度神经网络对低维流形的双重范围内学习

Doubly Robust Off-Policy Learning on Low-Dimensional Manifolds by Deep Neural Networks

论文作者

Chen, Minshuo, Liu, Hao, Liao, Wenjing, Zhao, Tuo

论文摘要

因果推论探讨了行动与协变量集中的奖励之间的因果关系。最近，深度学习在因果推论方面取得了出色的表现，但是现有的统计理论不能很好地解释这种经验成功，尤其是当协变量高维时。因果推论中的大多数理论结果都是渐近性的，遭受维度的诅咒，仅在有限行动的情况下起作用。为了弥合理论和实践之间的差距，本文通过深层神经网络进行了双重稳健的非政策学习。当协变量位于低维歧管上时，我们证明了非杂词遗憾界限，该边界的汇聚速率很快取决于歧管的内在维度。我们的结果涵盖了有限和连续性方案。我们的理论表明，深层神经网络适应协变量的低维几何结构，并部分解释了深度学习因果推论的成功。

Causal inference explores the causation between actions and the consequent rewards on a covariate set. Recently deep learning has achieved a remarkable performance in causal inference, but existing statistical theories cannot well explain such an empirical success, especially when the covariates are high-dimensional. Most theoretical results in causal inference are asymptotic, suffer from the curse of dimensionality, and only work for the finite-action scenario. To bridge such a gap between theory and practice, this paper studies doubly robust off-policy learning by deep neural networks. When the covariates lie on a low-dimensional manifold, we prove nonasymptotic regret bounds, which converge at a fast rate depending on the intrinsic dimension of the manifold. Our results cover both the finite- and continuous-action scenarios. Our theory shows that deep neural networks are adaptive to the low-dimensional geometric structures of the covariates, and partially explains the success of deep learning for causal inference.

下载PDF全文

下载文献需遵守相关版权规定

论文标题