首页> 中文期刊> 《计算机工程与设计》 >基于结合内容特征的TrustRank算法改进

基于结合内容特征的TrustRank算法改进

         

摘要

垃圾网页在利益的驱使下采用作弊手段欺骗搜索引擎获得更高的排名,干扰了用户对信息的获取.通过分析网页内容特征及其分布,提出了结合内容特征信息与TrustRank算法的方法对垃圾网页进行检测.实验结果表明,结合了内容特征信息的TrustRank算法能够有效的检测出垃圾网页.%Driven by the benefit, web spam deceives search engines get high ranking, which disturbs users to obtain information normally. Detecting web spam is one of the major challenges faced by search engines. According to analysis of content features of web pages and their distributions, a new method is proposed to detect the spam pages, and the method takes into account the content feature information in TrustRank algorithm. Experimental results show that TrustRank can effectively detect spam pages with the help of the web page content feature information.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号