论文标题
基于条件分数的数据生成的可能性得分匹配
Denoising Likelihood Score Matching for Conditional Score-based Data Generation
论文作者
论文摘要
许多现有的基于条件分数的数据生成方法利用贝叶斯定理将对数后密度的梯度分解为分数的混合物。这些方法促进了条件分数模型的训练程序,因为可以使用分数模型和分类器分别估算得分的混合物。但是,我们的分析表明,在这些方法中,分类器的训练目标可能导致严重的分数不匹配问题,这与估计的分数与真实分数偏离的情况相对应。这样的问题导致样品在扩散过程中被偏差的分数误导,从而导致采样质量降解。为了解决它,我们制定了一个新颖的训练目标,称为denoising似然得分匹配(DLSM)损失,使分类器匹配真实对数可能性密度的梯度。我们的实验证据表明,就几种关键评估指标而言,该提出的方法在CIFAR-10和CIFAR-100基准方面都优于先前的方法。因此,我们得出的结论是,通过采用DLSM,可以对条件分数进行准确的建模,并且得分不匹配的问题的效果得到缓解。
Many existing conditional score-based data generation methods utilize Bayes' theorem to decompose the gradients of a log posterior density into a mixture of scores. These methods facilitate the training procedure of conditional score models, as a mixture of scores can be separately estimated using a score model and a classifier. However, our analysis indicates that the training objectives for the classifier in these methods may lead to a serious score mismatch issue, which corresponds to the situation that the estimated scores deviate from the true ones. Such an issue causes the samples to be misled by the deviated scores during the diffusion process, resulting in a degraded sampling quality. To resolve it, we formulate a novel training objective, called Denoising Likelihood Score Matching (DLSM) loss, for the classifier to match the gradients of the true log likelihood density. Our experimental evidence shows that the proposed method outperforms the previous methods on both Cifar-10 and Cifar-100 benchmarks noticeably in terms of several key evaluation metrics. We thus conclude that, by adopting DLSM, the conditional scores can be accurately modeled, and the effect of the score mismatch issue is alleviated.