首页> 外文会议>International Conference on Artificial Intelligence and Data Processing >Spam Mail Detection using Naive Bayes method with Apache Spark
【24h】

Spam Mail Detection using Naive Bayes method with Apache Spark

机译:使用朴素贝叶斯方法和Apache Spark进行垃圾邮件检测

获取原文

摘要

Significant progress has been made in internet technologies with great progress in information infrastructure and in parallel, the amount of data produced has reached incredible dimensions. Nowadays, storage and processing of this data is the most important big data problem. In recent years new technologies have been developed in this study area. The Apache Spark project is considered one of the most important of these Technologies. In this study, a classification application was devoloped on Apache Spark using the Naive Bayes method which machine learning libraries of Apache Spark. A data set including of mails labeled as Spam and Not Spam was analyzed using Apache Spark and a classification application with a high accuracy ratio was performed. The performance of Apache Spark is quite different compared to other platforms that are most used in data analysis.
机译:互联网技术已取得重大进展,信息基础架构也取得了巨大进步,与此同时,生成的数据量达到了令人难以置信的规模。如今,此数据的存储和处理是最重要的大数据问题。近年来,在该研究领域中已经开发了新技术。 Apache Spark项目被认为是这些技术中最重要的项目之一。在这项研究中,使用Naive Bayes方法在Apache Spark上开发了分类应用程序,该方法是Apache Spark的机器学习库。使用Apache Spark分析了包含标记为垃圾邮件和非垃圾邮件的邮件的数据集,并执行了具有高准确率的分类应用程序。与数据分析中最常用的其他平台相比,Apache Spark的性能有很大不同。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号