首页> 外文期刊>The Electronic Library >Analysing user sentiment of Indian movie reviews: A probabilistic committee selection model
【24h】

Analysing user sentiment of Indian movie reviews: A probabilistic committee selection model

机译:分析印度电影评论的用户情绪:概率委员会选择模型

获取原文
获取原文并翻译 | 示例
       

摘要

Purpose - To be sustainable and competitive in the current business environment, it is useful to understand users' sentiment towards products and services. This critical task can be achieved via natural language processing and machine learning classifiers. This paper aims to propose a novel probabilistic committee selection classifier (PCC) to analyse and classify the sentiment polarities of movie reviews. Design/methodology/approach - An Indian movie review corpus is assembled for this study. Another publicly available movie review polarity corpus is also involved with regard to validating the results. The greedy stepwise search method is used to extract the features/words of the reviews. The performance of the proposed classifier is measured using different metrics, such as F-measure, false positive rate, receiver operating characteristic (ROC) curve and training time. Further, the proposed classifier is compared with other popular machine-learning classifiers, such as Bayesian, Naïve Bayes, Decision Tree (J48), Support Vector Machine and Random Forest. Findings - The results of this study show that the proposed classifier is good at predicting the positive or negative polarity of movie reviews. Its performance accuracy and the value of the ROC curve of the PCC is found to be the most suitable of all other classifiers tested in this study. This classifier is also found to be efficient at identifying positive sentiments of reviews, where it gives low false positive rates for both the Indian Movie Review and Review Polarity corpora used in this study. The training time of the proposed classifier is found to be slightly higher than that of Bayesian, Naive Bayes and J48. Research limitations/implications - Only movie review sentiments written in English are considered. In addition, the proposed committee selection classifier is prepared only using the committee of probabilistic classifiers; however, other classifier committees can also be built, tested and compared with the present experiment scenario. Practical implications - In this paper, a novel probabilistic approach is proposed and used for classifying movie reviews, and is found to be highly effective in comparison with other state-of-the-art classifiers. This classifier may be tested for different applications and may provide new insights for developers and researchers. Social implications - The proposed PCC may be used to classify different product reviews, and hence may be beneficial to organizations to justify users' reviews about specific products or services. By using authentic positive and negative sentiments of users, the credibility of the specific product, service or event may be enhanced. PCC may also be applied to other applications, such as spam detection, blog mining, news mining and various other data-mining applications. Originality/value - The constructed PCC is novel and was tested on Indian movie review data.
机译:目的-为了在当前的商业环境中保持可持续性和竞争力,了解用户对产品和服务的看法非常有用。这项关键任务可以通过自然语言处理和机器学习分类器来完成。本文旨在提出一种新颖的概率委员会选择分类器(PCC),以对电影评论的情感极性进行分析和分类。设计/方法/方法-为这项研究而组建的印度电影评论语料库。关于验证结果,还涉及另一个可公开获得的电影评论极性语料库。贪婪的逐步搜索方法用于提取评论的特征/单词。拟议的分类器的性能是使用不同的度量标准来衡量的,例如F度量,误报率,接收器工作特性(ROC)曲线和训练时间。此外,将提议的分类器与其他流行的机器学习分类器(例如贝叶斯,朴素贝叶斯,决策树(J48),支持向量机和随机森林)进行比较。调查结果-这项研究的结果表明,提出的分类器擅长预测电影评论的正面或负面。它的性能准确性和PCC的ROC曲线的值被认为是本研究中测试的所有其他分类器中最合适的。该分类器还可以有效地识别评论的积极情绪,在此研究中使用的“印度电影评论”和“评论极性”语料库的假阳性率低。发现提出的分类器的训练时间比贝叶斯,朴素贝叶斯和J48的训练时间略长。研究局限/含义-仅考虑用英语撰写的电影评论情感。此外,仅使用概率分类器委员会来准备拟议的委员会选择分类器。但是,也可以建立,测试其他分类器委员会,并将其与当前实验方案进行比较。实际意义-在本文中,提出了一种新颖的概率方法并将其用于对电影评论进行分类,并且与其他最新分类器相比,该方法是非常有效的。该分类器可以针对不同的应用程序进行测试,并且可以为开发人员和研究人员提供新的见解。社会影响-提议的PCC可用于对不同的产品评论进行分类,因此可能对组织证明用户对特定产品或服务的评论合理的组织有益。通过使用真实的用户正面和负面情绪,可以提高特定产品,服务或事件的可信度。 PCC还可以应用于其他应用程序,例如垃圾邮件检测,博客挖掘,新闻挖掘和各种其他数据挖掘应用程序。原创性/价值-所构建的PCC是新颖的,并已在印度电影评论数据上进行了测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号