首页> 外文会议>International Conference on Informatics and Computational Sciences >Hierarchical Sentence Sentiment Analysis Of Hotel Reviews Using The Naïve Bayes Classifier
【24h】

Hierarchical Sentence Sentiment Analysis Of Hotel Reviews Using The Naïve Bayes Classifier

机译:使用朴素贝叶斯分类器对酒店评论进行分层的句子情感分析

获取原文

摘要

Traveloka provides a space for its users to write reviews about their hotel reservation services. These reviews are very useful in informing hotel managers of the level of customer satisfaction. Sentiment analysis is a tool that can be used to analyse such reviews to determine whether they express opinions or not, so that the level of customer satisfaction can be measured based on the number of sentiments (positive or negative) contained in the opinions. In this research, the Naïve Bayes classifier was used to perform a hierarchical sentence sentiment analysis on hotel reviews obtained from Traveloka. In addition, two types of term weighting schemes were used for the feature extraction, namely, raw term frequency and TF-IDF. The results of this research indicated that it is better to use a hierarchical classification in sentiment analysis than a flat classification. The average F-measure value for the flat classification model was 75.18%, while for the hierarchical classification model it was 77.48%. These results showed that the use of a hierarchical classification in sentiment analysis improved the average performance of the classification model by 2.3%. The use of the raw term frequency feature extraction in a flat classification provided a higher F-measure value than the use of the TF-IDF feature extraction, with a margin of 3.9%. The average F-measure value for the flat classification using the raw term frequency feature extraction was 75.18%, while for the TF-IDF feature extraction it was 71.23%.
机译:Traveloka为用户提供了一个发表关于其酒店预订服务的评论的空间。这些评论对于告知酒店经理客户满意度水平非常有用。情感分析是一种工具,可用于分析此类评论以确定它们是否表达意见,以便可以基于意见中包含的情感数量(正面或负面)来衡量客户满意度。在这项研究中,朴素的贝叶斯分类器用于对从Traveloka获得的酒店评论进行分层的句子情感分析。另外,特征提取使用两种类型的项加权方案,即原始项频率和TF-IDF。这项研究的结果表明,在情感分析中使用分层分类比使用扁平分类更好。平面分类模型的平均F测量值为75.18%,而分层分类模型的平均F测量值为77.48%。这些结果表明,在情感分析中使用分层分类可将分类模型的平均性能提高2.3%。在平面分类中使用原始项频率特征提取可提供比使用TF-IDF特征提取更高的F测量值,裕度为3.9%。使用原始术语频率特征提取的平面分类的平均F测量值为75.18%,而对于TF-IDF特征提取的平均F测量值为71.23%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号