首页> 外文会议>International Conference on Soft Computing for Problem Solving >Spammer Classification Using Ensemble Methods over Content-Based Features
【24h】

Spammer Classification Using Ensemble Methods over Content-Based Features

机译:垃圾邮件发送者使用集合方法进行基于内容的功能分类

获取原文
获取外文期刊封面目录资料

摘要

As the web documents are raising at high scale, it is very difficult to access useful information. Search engines play a major role in retrieval of relevant information and knowledge. They deal with managing large amount of information with efficient page ranking algorithms. Still web spammers try to intrude the search engine results by various web spamming techniques for their personal benefit. According to the recent report from Internetlivestats in March (2016), an Internet survey company, states that there are currently 3.4 billion Internet users in the world. From this survey it can be judged that the search engines play a vital role in retrieval of information. In this research, we have investigated fifteen different machine learning classification algorithms over content based features to classify the spam and non spam web pages. Ensemble approach is done by using three algorithms which are computed as best on the basis of various parameters. Ten Fold Cross-validation approach is also used.
机译:随着Web文档的高度升高,很难访问有用的信息。搜索引擎在检索相关信息和知识中发挥着重要作用。他们处理使用有效的页面排列算法管理大量信息。仍然Web垃圾邮件发送者尝试通过各种Web垃圾邮件技术来侵入搜索引擎,以便他们的个人利益。根据InternetLivestats的最近报告(2016年3月),一个互联网调查公司,国家是世界上有34亿互联网用户。根据本调查,可以判断搜索引擎在检索信息中发挥重要作用。在本研究中,我们研究了基于内容的功能的十五种不同的机器学习分类算法,以分类垃圾邮件和非垃圾邮件网页。通过使用三种算法基于各种参数使用三种算法来完成集合方法。还使用十个倍数交叉验证方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号