学习适应跨域的特定班级特征以进行语义分割

论文标题

学习适应跨域的特定班级特征以进行语义分割

Learning to adapt class-specific features across domains for semantic segmentation

论文作者

Menta, Mikel, Romero, Adriana, van de Weijer, Joost

论文摘要

无监督的域适应性的最新进展显示了对抗性训练以适应跨域的特征的有效性，使神经网络具有在目标域进行测试的能力，而无需在该领域中进行任何训练注释。绝大多数现有域的适应模型都取决于图像翻译网络，这些网络通常包含大量域特异性参数。此外，特征适应步骤通常在全球范围内，在粗糙的水平上进行，阻碍了其对语义分割等任务的适用性，在这种情况下，细节对于提供敏锐的结果至关重要。在本文中，我们提出了一种新颖的体系结构，该架构学会通过考虑每个类信息来适应跨领域的功能。为此，我们设计了一个有条件的像素歧视器网络，其输出在分割掩码上进行条件。此外，随着图像翻译的最新进展，我们采用了最近引入的Stargan体系结构作为图像翻译骨干，因为它能够通过单个发电机网络跨多个域进行翻译。旨在评估拟议方法的有效性的细分任务的初步结果突出了模型的潜力，从而改善了强大的基准和替代设计。

Recent advances in unsupervised domain adaptation have shown the effectiveness of adversarial training to adapt features across domains, endowing neural networks with the capability of being tested on a target domain without requiring any training annotations in this domain. The great majority of existing domain adaptation models rely on image translation networks, which often contain a huge amount of domain-specific parameters. Additionally, the feature adaptation step often happens globally, at a coarse level, hindering its applicability to tasks such as semantic segmentation, where details are of crucial importance to provide sharp results. In this thesis, we present a novel architecture, which learns to adapt features across domains by taking into account per class information. To that aim, we design a conditional pixel-wise discriminator network, whose output is conditioned on the segmentation masks. Moreover, following recent advances in image translation, we adopt the recently introduced StarGAN architecture as image translation backbone, since it is able to perform translations across multiple domains by means of a single generator network. Preliminary results on a segmentation task designed to assess the effectiveness of the proposed approach highlight the potential of the model, improving upon strong baselines and alternative designs.

下载PDF全文

下载文献需遵守相关版权规定

论文标题