域的跨性别双重一致性学习，用于域广义语义细分

论文标题

域的跨性别双重一致性学习，用于域广义语义细分

Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation

论文作者

Zhao, Yuyang, Zhong, Zhun, Zhao, Na, Sebe, Nicu, Lee, Gim Hee

论文摘要

在本文中，我们研究了合成到现实域的通用语义分割的任务，该任务旨在学习一个仅使用合成数据的现实场景的强大模型。合成数据和现实世界数据之间的大域移动，包括有限的源环境变化以及合成和现实世界数据之间的较大分布差距，极大地阻碍了看不见的现实现实场景中的模型性能。在这项工作中，我们建议使用样式挂钩的双重一致性学习（SHADE）框架来处理此类域转移。具体而言，阴影是基于两个一致性约束（样式一致性（SC）和回顾一致性（RC）构建的。 SC丰富了来源情况，并鼓励模型在样式多样的样本中学习一致的表示。 RC利用现实世界的知识来防止模型过度拟合到合成数据，因此在很大程度上使综合模型和现实世界模型之间的表示一致。此外，我们提出了一个新颖的样式幻觉模块（SHM），以生成对一致性学习至关重要的样式变化样本。 SHM从源分布中选择基础样式，从而使模型能够在训练过程中动态生成各种和现实的样本。实验表明，我们的阴影在单个和多源设置上的三个现实世界数据集的平均MIOU的平均MIOU的平均MIOU平均胜过最先进的方法，并胜过最先进的方法。

In this paper, we study the task of synthetic-to-real domain generalized semantic segmentation, which aims to learn a model that is robust to unseen real-world scenes using only synthetic data. The large domain shift between synthetic and real-world data, including the limited source environmental variations and the large distribution gap between synthetic and real-world data, significantly hinders the model performance on unseen real-world scenes. In this work, we propose the Style-HAllucinated Dual consistEncy learning (SHADE) framework to handle such domain shift. Specifically, SHADE is constructed based on two consistency constraints, Style Consistency (SC) and Retrospection Consistency (RC). SC enriches the source situations and encourages the model to learn consistent representation across style-diversified samples. RC leverages real-world knowledge to prevent the model from overfitting to synthetic data and thus largely keeps the representation consistent between the synthetic and real-world models. Furthermore, we present a novel style hallucination module (SHM) to generate style-diversified samples that are essential to consistency learning. SHM selects basis styles from the source distribution, enabling the model to dynamically generate diverse and realistic samples during training. Experiments show that our SHADE yields significant improvement and outperforms state-of-the-art methods by 5.05% and 8.35% on the average mIoU of three real-world datasets on single- and multi-source settings, respectively.

下载PDF全文

下载文献需遵守相关版权规定

论文标题