论文标题
模糊粗糙理论中规则归纳的加速器
An Accelerator for Rule Induction in Fuzzy Rough Theory
论文作者
论文摘要
基于规则的分类器,这些分类器提取了一部分引起的规则,以有效地学习/矿山,同时保留可见性信息,在人工智能中起着至关重要的作用。但是,在这个大数据时代,整个数据集的规则归纳在计算上是密集的。到目前为止,据我们所知,尚无重点关注加速规则诱导的已知方法。这是第一个考虑加速技术以减少规则归纳中计算规模的研究。我们提出了一个基于模糊粗糙理论的规则归纳的加速器。加速器可以避免冗余计算并加速规则分类器的构建。首先,提出了基于一致性的规则诱导方法,称为基于一致性的价值降低(CVR),并用作加速的基础。其次,我们引入了一个压实的搜索空间,称为密钥集,其中仅包含更新诱导规则所需的关键实例,以进行降低价值。密钥集的单调性确保了我们的加速器的可行性。第三,规则诱导加速器的设计基于密钥集,并且从理论上保证显示与未加速版本相同的结果。具体而言,密钥集的等级保存属性确保了加速器实现的规则归纳与未加速方法之间的一致性。最后,广泛的实验表明,所提出的加速器的执行速度比未加密的基于规则的分类器方法更快,尤其是在具有大量实例的数据集上。
Rule-based classifier, that extract a subset of induced rules to efficiently learn/mine while preserving the discernibility information, plays a crucial role in human-explainable artificial intelligence. However, in this era of big data, rule induction on the whole datasets is computationally intensive. So far, to the best of our knowledge, no known method focusing on accelerating rule induction has been reported. This is first study to consider the acceleration technique to reduce the scale of computation in rule induction. We propose an accelerator for rule induction based on fuzzy rough theory; the accelerator can avoid redundant computation and accelerate the building of a rule classifier. First, a rule induction method based on consistence degree, called Consistence-based Value Reduction (CVR), is proposed and used as basis to accelerate. Second, we introduce a compacted search space termed Key Set, which only contains the key instances required to update the induced rule, to conduct value reduction. The monotonicity of Key Set ensures the feasibility of our accelerator. Third, a rule-induction accelerator is designed based on Key Set, and it is theoretically guaranteed to display the same results as the unaccelerated version. Specifically, the rank preservation property of Key Set ensures consistency between the rule induction achieved by the accelerator and the unaccelerated method. Finally, extensive experiments demonstrate that the proposed accelerator can perform remarkably faster than the unaccelerated rule-based classifier methods, especially on datasets with numerous instances.