论文标题

深度平衡方法扩散模型

Deep Equilibrium Approaches to Diffusion Models

论文作者

Pokle, Ashwini, Geng, Zhengyang, Kolter, Zico

论文摘要

基于扩散的生成模型在生成高质量图像方面非常有效,生成的样品通常超过了其他模型在几种指标下产生的样品的质量。但是,这些模型的一个区别特征是它们通常需要长采样链来产生高保真图像。这不仅提出了抽样时间镜头的挑战,而且还源于通过这些链反向传播的固有困难,以完成诸如模型反转之类的任务,即大约找到生成已知图像的潜在状态。在本文中,我们通过不同的角度来研究扩散模型,即(深)平衡(DEQ)固定点模型的扩散模型。具体而言,我们扩展了最近的Denoisis扩散隐式模型(DDIM; Song等,2020),并将整个采样链建模为关节,多元固定点系统。该设置提供了扩散和平衡模型的优雅统一,并在1)单图像采样中显示出好处,因为它用平行的典型采样过程代替了完全典型的典型采样过程; 2)模型反转,我们可以在DEQ设置中利用快速梯度更快地找到生成给定图像的噪声。该方法也是正交的,因此与用于减少采样时间或改善模型反转的其他方法互补。我们在包括CIFAR10,Celeba和Lsun卧室和教堂在内的几个数据集中演示了方法的出色表现。

Diffusion-based generative models are extremely effective in generating high-quality images, with generated samples often surpassing the quality of those produced by other models under several metrics. One distinguishing feature of these models, however, is that they typically require long sampling chains to produce high-fidelity images. This presents a challenge not only from the lenses of sampling time, but also from the inherent difficulty in backpropagating through these chains in order to accomplish tasks such as model inversion, i.e. approximately finding latent states that generate known images. In this paper, we look at diffusion models through a different perspective, that of a (deep) equilibrium (DEQ) fixed point model. Specifically, we extend the recent denoising diffusion implicit model (DDIM; Song et al. 2020), and model the entire sampling chain as a joint, multivariate fixed point system. This setup provides an elegant unification of diffusion and equilibrium models, and shows benefits in 1) single image sampling, as it replaces the fully-serial typical sampling process with a parallel one; and 2) model inversion, where we can leverage fast gradients in the DEQ setting to much more quickly find the noise that generates a given image. The approach is also orthogonal and thus complementary to other methods used to reduce the sampling time, or improve model inversion. We demonstrate our method's strong performance across several datasets, including CIFAR10, CelebA, and LSUN Bedrooms and Churches.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源