查询语义重建，以进行几次分段的背景

论文标题

查询语义重建，以进行几次分段的背景

Query Semantic Reconstruction for Background in Few-Shot Segmentation

论文作者

Guan, Haoyan, Spratling, Michael

论文摘要

很少有射击分段（FSS）旨在使用一些带注释的样本分割看不见的类。通常，从注释的支持图像中提取代表前景类的原型，并与代表查询图像中每个像素的特征匹配。但是，以这种方式学到的模型具有不足的歧视性，并且通常会产生误报：将背景像素作为前景错误分类。一些FSS方法试图通过使用支持图像中的背景来帮助识别查询图像中的背景。但是，这些图像的背景通常是完全不同的，因此，支持图像背景信息是无信息的。本文提出了一种方法QSR，该方法从查询图像本身中提取背景，因此可以更好地区分查询图像中的前景和背景特征。这是通过修改训练过程将原型与类标签相关联，包括培训数据和代表未知背景对象的潜在类别的已知类别的培训过程来实现。然后，此类信息用于从查询图像中提取背景原型。为了成功将原型与类标签联系起来，并提取能够预测图像背景区域的掩码的背景原型，诱导提取和使用前景原型的机械变得更加歧视。对Pascal-5i和CoCo-20i数据集的1-shot和5-shot FSS的实验表明，所提出的方法可将其应用于基线方法的性能显着改善。由于QSR仅在训练过程中运行，因此在测试过程中没有额外的计算复杂性产生这些改进的结果。

Few-shot segmentation (FSS) aims to segment unseen classes using a few annotated samples. Typically, a prototype representing the foreground class is extracted from annotated support image(s) and is matched to features representing each pixel in the query image. However, models learnt in this way are insufficiently discriminatory, and often produce false positives: misclassifying background pixels as foreground. Some FSS methods try to address this issue by using the background in the support image(s) to help identify the background in the query image. However, the backgrounds of theses images is often quite distinct, and hence, the support image background information is uninformative. This article proposes a method, QSR, that extracts the background from the query image itself, and as a result is better able to discriminate between foreground and background features in the query image. This is achieved by modifying the training process to associate prototypes with class labels including known classes from the training data and latent classes representing unknown background objects. This class information is then used to extract a background prototype from the query image. To successfully associate prototypes with class labels and extract a background prototype that is capable of predicting a mask for the background regions of the image, the machinery for extracting and using foreground prototypes is induced to become more discriminative between different classes. Experiments for both 1-shot and 5-shot FSS on both the PASCAL-5i and COCO-20i datasets demonstrate that the proposed method results in a significant improvement in performance for the baseline methods it is applied to. As QSR operates only during training, these improved results are produced with no extra computational complexity during testing.

下载PDF全文

下载文献需遵守相关版权规定

论文标题