论文标题
fase-al-适应合奏的快速自适应堆叠以支持主动学习
Fase-AL -- Adaptation of Fast Adaptive Stacking of Ensembles for Supporting Active Learning
论文作者
论文摘要
近年来,已经对数据流的分类算法进行了广泛的研究。但是,许多这些算法都是为了监督学习而设计的,需要标记为实例。然而,数据的标签是昂贵且耗时的。因此,已经提出了替代学习范例来降低标签过程的成本,而不会大大损失模型性能。主动学习是这些范式之一,其主要目的是建立分类模型,该模型要求最低的标记示例数量达到足够的准确性。因此,这项工作提出了fase-al算法,该算法通过使用主动学习诱导非标记实例诱导分类模型。 fase-al基于算法的算法快速自适应堆叠(fase)。 FASE是一种集合算法,当输入数据流具有概念漂移时检测和调整模型。将FASE-AL与文献中发现的四种不同主动学习策略进行了比较。实验中使用了真实和合成数据库。该算法在正确分类的实例的百分比方面取得了有希望的结果。
Classification algorithms to mine data stream have been extensively studied in recent years. However, a lot of these algorithms are designed for supervised learning which requires labeled instances. Nevertheless, the labeling of the data is costly and time-consuming. Because of this, alternative learning paradigms have been proposed to reduce the cost of the labeling process without significant loss of model performance. Active learning is one of these paradigms, whose main objective is to build classification models that request the lowest possible number of labeled examples achieving adequate levels of accuracy. Therefore, this work presents the FASE-AL algorithm which induces classification models with non-labeled instances using Active Learning. FASE-AL is based on the algorithm Fast Adaptive Stacking of Ensembles (FASE). FASE is an ensemble algorithm that detects and adapts the model when the input data stream has concept drift. FASE-AL was compared with four different strategies of active learning found in the literature. Real and synthetic databases were used in the experiments. The algorithm achieves promising results in terms of the percentage of correctly classified instances.