论文标题

有效的电子邮件垃圾邮件检测系统使用极端梯度提升

Effective Email Spam Detection System using Extreme Gradient Boosting

论文作者

Mustapha, Ismail B., Hasan, Shafaatunnur, Olatunji, Sunday O., Shamsuddin, Siti Mariyam, Kazeem, Afolabi

论文摘要

电子邮件向电子设备用户提供的电子邮件提供的知名度,成本效益和信息交换的易用性困扰着不断上升或垃圾邮件的电子邮件数量增加。由于需要保护电子邮件用户免受这种日益严重的威胁的需求,在过去的十年中,垃圾邮件过滤/检测系统的研究越来越活跃。但是,垃圾邮件电子邮件的适应性通常使大多数此类系统无效。虽然文献中已经报道了几种垃圾邮件检测模型,但在样本测试数据中报告的性能显示了更多改进的空间。在这项研究中提出的是一种基于极端梯度提升(XGBOOST)的改进的垃圾邮件检测模型,据我们所知,这几乎没有引起关注垃圾邮件的电子邮件检测问题。实验结果表明,所提出的模型在广泛的评估指标上的表现优于早期方法。还提供了与早期作品结果相比的模型结果的详尽分析。

The popularity, cost-effectiveness and ease of information exchange that electronic mails offer to electronic device users has been plagued with the rising number of unsolicited or spam emails. Driven by the need to protect email users from this growing menace, research in spam email filtering/detection systems has being increasingly active in the last decade. However, the adaptive nature of spam emails has often rendered most of these systems ineffective. While several spam detection models have been reported in literature, the reported performance on an out of sample test data shows the room for more improvement. Presented in this research is an improved spam detection model based on Extreme Gradient Boosting (XGBoost) which to the best of our knowledge has received little attention spam email detection problems. Experimental results show that the proposed model outperforms earlier approaches across a wide range of evaluation metrics. A thorough analysis of the model results in comparison to the results of earlier works is also presented.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源