首页> 外文会议>Multi-disciplinary International Conference on Artificial Intelligence >Comparing of Multi-class Text Classification Methods for Automatic Ratings of Consumer Reviews
【24h】

Comparing of Multi-class Text Classification Methods for Automatic Ratings of Consumer Reviews

机译:多级文本分类方法对消费者评论自动评级的多级文本分类方法

获取原文

摘要

Consumer reviews show inconsistent ratings when compared to their contents as a result of sarcastic feedback. Consequently, they cannot provide valuable feedback to improve products and services of the firms. One possible solution is to utilize consumer review contents to identify the true ratings. In this work, different multi-class classification methods were applied to assign automatic ratings for consumer reviews based on a 5-star rating scale, where the original review ratings were inconsistent with the content. Two term weighting schemes (i.e. tf-idf and tf-igm) and five supervised machine learning algorithms (i.e. κ-NN, MNB, RF, XGBoost and SVM) were compared. The dataset was downloaded from the Amazon website, and language experts helped to correct the real rating for each consumer review. After verifying the effectiveness of the proposed methods, the multi-class classifier model developed by SVM along with tf-igm returned the best results for automatic ratings of consumer reviews, with average improved scores of accuracies and F1 over the other methods at 11.7% and 10.5%, respectively.
机译:与讽刺的反馈相比,消费者评论显示与其内容相比不一致的评级。因此,他们无法提供有价值的反馈,以改善公司的产品和服务。一个可能的解决方案是利用消费者审查内容来识别真正的评级。在这项工作中,应用了不同的多级分类方法,以根据五星级评级规模为消费者评论分配自动评级,原始审查评级与内容不一致。比较了两个术语加权方案(即TF-IDF和TF-IgM)和五种监督机器学习算法(即κ-NN,MNB,RF,XGBoost和SVM)。数据集从亚马逊网站下载,语言专家有助于纠正每个消费者审查的实际评级。在验证所提出的方法的有效性之后,SVM开发的多级分类器模型以及TF-IGM返回了消费者评论的自动评级的最佳效果,在11.7%的其他方法中,平均改善了精度和F1的分数。分别为10.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号