首页> 外文学位 >Review spam and reviewer behavior analysis.
【24h】

Review spam and reviewer behavior analysis.

机译:查看垃圾邮件和审阅者的行为分析。

获取原文
获取原文并翻译 | 示例

摘要

In recent years, opinion retrieval and mining attracted a lot of attention in the research community due to a wide range of applications. An important issue that has been largely ignored is the review spam, which refers to writing fake reviews to promote or to damage the reputations of some objects. Detecting such fake reviews and reviewers is critical for effective opinion mining and opinion search. In first part of this work we show different classification techniques used to detect various types of review spam with some degree of success. However, review spam appears to be harder to detect because it is very hard, if not impossible, to recognize fake reviews by manually reading them. So, in the second part of this work, we deal with a restricted problem, i.e. identifying unusual review patterns which can represent suspicious behaviors of reviewers, e.g., a reviewer who wrote all negative reviews on many products of a brand when other reviewers are generally positive about the brand is naturally suspicious. We believe that these patterns can provide useful signals for spam detection. We formulate the problem as finding unexpected rules and rule groups by borrowing some ideas from data mining. To find unexpected rules, one needs to know what is expected. A natural approach is proposed, which requires no user input. It defines expectations based on the inherent distribution of the data. Based on these expectations, metrics are designed to measure unexpectedness of rules and rule groups. The technique is domain and application independent. Using the technique, we analyzed an Amazon.com review dataset and found many unexpected rules and rule groups which indicate spam activities. To demonstrate the domain/application independent nature of the proposed method, we also analyzed a set of tweets from Twitter.com and found many pieces of interesting information.
机译:近年来,由于广泛的应用,意见检索和挖掘在研究界引起了很多关注。很大程度上被忽略的重要问题是评论垃圾邮件,它是指撰写虚假评论以促进或破坏某些对象的声誉。检测到这些虚假的评论和评论者对于有效的观点挖掘和观点搜索至关重要。在这项工作的第一部分中,我们展示了用于检测各种类型的垃圾评论垃圾邮件的不同分类技术,并取得了一定程度的成功。但是,评论垃圾邮件似乎更难发现,因为很难(即使不是不可能)通过手动阅读来识别虚假评论。因此,在本工作的第二部分中,我们处理一个受限制的问题,即确定异常的评论模式,这些模式可能代表评论者的可疑行为,例如,当其他评论者通常都在品牌的许多产品上发表负面评论时,评论者对品牌的正面评价自然是可疑的。我们认为,这些模式可以为垃圾邮件检测提供有用的信号。我们通过从数据挖掘中借鉴一些想法,将问题表述为寻找意外的规则和规则组。为了找到意想不到的规则,人们需要知道所期望的。提出了一种自然的方法,不需要用户输入。它基于数据的固有分布来定义期望。基于这些期望,度量标准旨在测量规则和规则组的意外情况。该技术是域和应用程序无关的。使用该技术,我们分析了Amazon.com评论数据集,发现了许多表明垃圾邮件活动的意外规则和规则组。为了演示所提出方法的域/应用程序独立性,我们还分析了Twitter.com上的一组推文,并发现了许多有趣的信息。

著录项

  • 作者

    Jindal, Nitin.;

  • 作者单位

    University of Illinois at Chicago.;

  • 授予单位 University of Illinois at Chicago.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 142 p.
  • 总页数 142
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 遥感技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号