首页> 中文期刊>郑州大学学报(理学版) >一种应用于博客的垃圾评论识别方法

一种应用于博客的垃圾评论识别方法

     

摘要

A new method to identify blog comments spam was proposed. The short comments were identified by the network common words first, and made K rounds to identify the comments which used the improved similarity formula. Following every identifies, the weights of keywords and extend keywords were adjusted. All the comments were identified to the category. The spam reviews were filter again by the network common words and the keywords, and more legitimate comments were identified. Experimental results showed that the method, to some extent, improved the recognition accuracy.%针对博客垃圾评论泛滥的问题,提出了一种识别博客垃圾评论的新方法.利用网络常用语对短小评论先进行评论的识别,然后利用改进的相似度公式对评论进行了K轮评论的识别,在每轮识别之后,对主题词进行权重的调整和主题词扩展;待所有评论识别完毕,再利用网络常用语和主题词对识别出的垃圾评论进行第二次过滤,过滤出垃圾评论中的合法评论.实验结果表明,利用该方法进行评论识别在一定程度上提高了识别垃圾评论的准确率和召回率.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号