潜在特征的单个图像的超分辨率重建

论文标题

潜在特征的单个图像的超分辨率重建

Super-resolution Reconstruction of Single Image for Latent features

论文作者

Wang, Xin, Yan, Jing-Ke, Cai, Jing-Ye, Deng, Jian-Hua, Qin, Qin, Cheng, Yao

论文摘要

单片图像超分辨率（SISR）通常着重于将各种降级的低分辨率（LR）图像恢复为单个高分辨率（HR）图像。但是，在SISR任务期间，模型同时保持高质量和快速采样，同时保留细节和纹理特征的多样性通常是具有挑战性的。这一挑战可能导致诸如模型崩溃，重建的HR图像中缺乏丰富的细节和纹理特征以及模型采样的过度耗时。为了解决这些问题，本文提出了一个潜在的面向特征的扩散概率模型（LDDPM）。首先，我们设计了一个有条件的编码器，能够有效地编码LR图像，减少了模型图像重建的解决方案空间，从而提高了重建图像的质量。然后，我们采用了归一化流量和多模式对抗训练，从复杂的多模式分布中学习，以建模降解分布。这样做可以在最小数量的采样步骤中提高生成建模功能。我们提出的模型与主流数据集上现有的SISR方法的实验比较表明，我们的模型重建更现实的HR图像，并在多个评估指标上取得更好的性能，为解决SISR任务提供了新的视角。

Single-image super-resolution (SISR) typically focuses on restoring various degraded low-resolution (LR) images to a single high-resolution (HR) image. However, during SISR tasks, it is often challenging for models to simultaneously maintain high quality and rapid sampling while preserving diversity in details and texture features. This challenge can lead to issues such as model collapse, lack of rich details and texture features in the reconstructed HR images, and excessive time consumption for model sampling. To address these problems, this paper proposes a Latent Feature-oriented Diffusion Probability Model (LDDPM). First, we designed a conditional encoder capable of effectively encoding LR images, reducing the solution space for model image reconstruction and thereby improving the quality of the reconstructed images. We then employed a normalized flow and multimodal adversarial training, learning from complex multimodal distributions, to model the denoising distribution. Doing so boosts the generative modeling capabilities within a minimal number of sampling steps. Experimental comparisons of our proposed model with existing SISR methods on mainstream datasets demonstrate that our model reconstructs more realistic HR images and achieves better performance on multiple evaluation metrics, providing a fresh perspective for tackling SISR tasks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题