首页> 外文会议>International Conference on Information Management >Identification of fake reviews using semantic and behavioral features
【24h】

Identification of fake reviews using semantic and behavioral features

机译:使用语义和行为特征识别假评论

获取原文

摘要

In recent years, online reviews have been playing an important role in making purchase decisions. This is because, these reviews can provide customers with large amounts of useful information about the goods or service. However, to promote factitiously or lower the quality of the products or services, spammers may forge and produce fake reviews. Due to such behavior of the spammers, customers would be misleaded and make wrong decisions. Thus detecting fake (spam) reviews is a significant problem. In this paper, we propose two types of features and apply supervised machine learning algorithms for performing classification on Yelp's real-life data. In terms of features used, there are two new semantic feature sets: readability features and topic features. Our results show that our proposed new features are more effective than n-gram features in detecting spam reviews. To improve classification on the real Yelp review data, we use a set of behavioral features about reviewers and their reviews for learning, which dramatically improves the classification result on real-life opinion spam data. For further improvement, we also ensure the number of reviewers instead of reviews is balanced.
机译:近年来,在线评论在制定购买决策中一直发挥着重要作用。这是因为,这些评论可以为客户提供有关商品或服务的大量有用信息。但是,为了人为地宣传或降低产品或服务的质量,垃圾邮件发送者可能伪造并产生虚假评论。由于垃圾邮件发送者的这种行为,客户将被误导并做出错误的决定。因此,检测虚假(垃圾邮件)评论是一个重大问题。在本文中,我们提出了两种类型的功能,并应用了监督机器学习算法对Yelp的真实数据进行分类。就使用的功能而言,有两个新的语义功能集:可读性功能和主题功能。我们的结果表明,我们提出的新功能在检测垃圾邮件评论方面比n-gram功能更有效。为了改善对Yelp真实评论数据的分类,我们使用了一组有关评论者及其评论的学习行为特征,从而极大地改善了对现实生活中的垃圾评论数据的分类结果。为了进一步改进,我们还确保平衡审阅者而不是审阅者的数量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号