论文标题

准确检测网络钓鱼网站的智能方法

Intelligent Methods for Accurately Detecting Phishing Websites

论文作者

Abuzuraiq, Almaha, Alkasassbeh, Mouhammd, Almseidin, Mohammad

论文摘要

随着技术发展的增加,有大量的网站具有不同的目的。但是,在这个大型收藏中存在一种特定的类型,即旨在欺骗用户的所谓网站钓鱼网站。检测网络钓鱼网站的主要挑战是发现已使用的技术。在网络钓鱼者不断改善其策略并创建可以保护自己免受多种形式检测方法的网页的地方。因此,非常有必要开发可靠,活跃和当代的网络钓鱼检测方法来对抗phishers使用的自适应技术。在本文中,通过将它们分为三个主要组来审查不同的网络钓鱼检测方法。然后,提出的模型分为两个阶段。在第一阶段,使用不同的机器学习算法来验证所选数据集并在其上应用功能选择方法。因此,通过仅利用与随机森林相结合的48个功能中的20个功能仅利用20个功能来实现最佳精度。在第二阶段,同一数据集应用于各种模糊逻辑算法。以及模糊逻辑算法应用的实验结果令人难以置信。在仅使用五个特征的Furia算法的情况下,准确率为99.98%。最后,对应用机器学习算法和模糊逻辑算法之间的结果进行了比较和讨论。使用模糊逻辑算法的性能超过了机器学习算法的使用。

With increasing technology developments, there is a massive number of websites with varying purposes. But a particular type exists within this large collection, the so-called phishing sites which aim to deceive their users. The main challenge in detecting phishing websites is discovering the techniques that have been used. Where phishers are continually improving their strategies and creating web pages that can protect themselves against many forms of detection methods. Therefore, it is very necessary to develop reliable, active and contemporary methods of phishing detection to combat the adaptive techniques used by phishers. In this paper, different phishing detection approaches are reviewed by classifying them into three main groups. Then, the proposed model is presented in two stages. In the first stage, different machine learning algorithms are applied to validate the chosen dataset and applying features selection methods on it. Thus, the best accuracy was achieved by utilizing only 20 features out of 48 features combined with Random Forest is 98.11%. While in the second stage, the same dataset is applied to various fuzzy logic algorithms. As well the experimental results from the application of Fuzzy logic algorithms were incredible. Where in applying the FURIA algorithm with only five features the accuracy rate was 99.98%. Finally, comparison and discussion of the results between applying machine learning algorithms and fuzzy logic algorithms is done. Where the performance of using fuzzy logic algorithms exceeds the use of machine learning algorithms.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源