论文标题
与自动编码器无监督的概念漂移学习
Unsupervised Unlearning of Concept Drift with Autoencoders
论文作者
论文摘要
概念漂移是指影响未来样本数据流的数据分布的变化。因此,在数据流上运行的学习模型可能会过时,需要昂贵且艰难的调整,例如再培训或适应。现有方法通常实现局部概念漂移适应方案,其中使用模型的增量学习,或者在漂移检测机构触发警报时完全重新训练。本文提出了一种替代方法,其中根据自动编码器介绍了全球层面的无监督和模型概念漂移适应方法。具体而言,所提出的方法旨在``''Unrearn'概念漂移而无需重新训练或调整任何在数据上运行的学习模型。在两个应用领域进行了广泛的实验评估。我们考虑了一个现实的水分配网络,该网络与30多个模型开放,我们从中创建了200个模拟数据集 /方案。我们进一步考虑了与图像相关的任务,以证明我们方法的有效性。
Concept drift refers to a change in the data distribution affecting the data stream of future samples. Consequently, learning models operating on the data stream might become obsolete, and need costly and difficult adjustments such as retraining or adaptation. Existing methods usually implement a local concept drift adaptation scheme, where either incremental learning of the models is used, or the models are completely retrained when a drift detection mechanism triggers an alarm. This paper proposes an alternative approach in which an unsupervised and model-agnostic concept drift adaptation method at the global level is introduced, based on autoencoders. Specifically, the proposed method aims to ``unlearn'' the concept drift without having to retrain or adapt any of the learning models operating on the data. An extensive experimental evaluation is conducted in two application domains. We consider a realistic water distribution network with more than 30 models in-place, from which we create 200 simulated data sets / scenarios. We further consider an image-related task to demonstrate the effectiveness of our method.