对域适应的简要审查

论文标题

对域适应的简要审查

A Brief Review of Domain Adaptation

论文作者

Farahani, Abolfazl, Voghoei, Sahar, Rasheed, Khaled, Arabnia, Hamid R.

论文摘要

经典的机器学习假设培训和测试集来自相同的分布。因此，从标记的培训数据中学到的模型预计将在测试数据上表现良好。但是，由于许多因素，例如从不同来源收集培训和测试集，或者由于数据随时间变化而进行过时的培训集，因此该假设可能并不总是存在于培训和测试数据源于不同分布的现实应用程序中。在这种情况下，域分布将存在差异，并且在新数据集中天真地应用训练有素的模型可能会导致性能下降。域的适应性是机器学习中的一个子场，旨在通过对齐域之间的差异来应对这些类型的问题，从而可以将受过训练的模型推广到感兴趣的领域。本文着重于无监督的域适应性，其中标签仅在源域中可用。它从不同的角度解决了域适应的分类。此外，它提出了一些成功的浅域适应方法，旨在解决领域适应问题。

Classical machine learning assumes that the training and test sets come from the same distributions. Therefore, a model learned from the labeled training data is expected to perform well on the test data. However, This assumption may not always hold in real-world applications where the training and the test data fall from different distributions, due to many factors, e.g., collecting the training and test sets from different sources, or having an out-dated training set due to the change of data over time. In this case, there would be a discrepancy across domain distributions, and naively applying the trained model on the new dataset may cause degradation in the performance. Domain adaptation is a sub-field within machine learning that aims to cope with these types of problems by aligning the disparity between domains such that the trained model can be generalized into the domain of interest. This paper focuses on unsupervised domain adaptation, where the labels are only available in the source domain. It addresses the categorization of domain adaptation from different viewpoints. Besides, It presents some successful shallow and deep domain adaptation approaches that aim to deal with domain adaptation problems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题