首页> 外文期刊>Expert systems with applications >Using evidence based content trust model for spam detection
【24h】

Using evidence based content trust model for spam detection

机译:使用基于证据的内容信任模型进行垃圾邮件检测

获取原文
获取原文并翻译 | 示例
           

摘要

Content trust is one of the main components in the research of information retrieval. As it gets easier to add information to the Web via HTML pages, wikis, blogs, and other documents, it gets tougher to distinguish accurate or trustworthy information from inaccurate or untrustworthy information on the Web. Current technology of spam detection is based on binary metric, that is binary classification is adapted in the spam detection. In order to meet the users' need and preference, more accurate metric is needed in the content trust as well as in detecting spam information. In this paper, we use the notion of content trust for spam detection, and regard it as a ranking problem. Besides traditional text feature attributes, information quality based evidence is introduced to define the trust feature of spam information, and a novel content trust learning algorithm based on these evidence is proposed. Finally, a Web spam detection system is developed and the experiments on the real Web data are carried out, which show the proposed method performs very well in practice.
机译:内容信任是信息检索研究的主要组成部分之一。通过HTML页面,Wiki,博客和其他文档将信息添加到Web变得更加容易,将Web上准确或可信赖的信息与不准确或不可信赖的信息区分开变得更加困难。当前的垃圾邮件检测技术基于二进制度量,即在垃圾邮件检测中采用了二进制分类。为了满足用户的需求和偏爱,在内容信任以及检测垃圾邮件信息中需要更准确的度量。在本文中,我们将内容信任的概念用于垃圾邮件检测,并将其视为排名问题。除了传统的文本特征属性,还引入了基于信息质量的证据来定义垃圾邮件信息的信任特征,并提出了一种基于这些证据的新颖的内容信任学习算法。最后,开发了一个Web垃圾邮件检测系统,并在真实的Web数据上进行了实验,表明该方法在实践中表现良好。

著录项

  • 来源
    《Expert systems with applications》 |2010年第8期|p.5599-5606|共8页
  • 作者单位

    Department of Computer Science and Engineering, Tongji University, Shanghai 200092, China Tongji Branch National Engineering and Technology Center of High Performance, Shanghai 200092, China Key Laboratory of Embedded System and Service Computing, Ministry of Education, Shanghai 200092, China;

    Department of Computer Science and Engineering, Tongji University, Shanghai 200092, China Tongji Branch National Engineering and Technology Center of High Performance, Shanghai 200092, China Key Laboratory of Embedded System and Service Computing, Ministry of Education, Shanghai 200092, China;

    School of Economics and Management, Tongji University, Shanghai 200092, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    content trust; web spam; ranking; SVM; machine learning;

    机译:内容信任;网络垃圾邮件;排行;支持向量机;机器学习;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号