首页> 外文会议>Conference on Information Communication Technology and Society >A framework for sentiment analysis with opinion mining of hotel reviews
【24h】

A framework for sentiment analysis with opinion mining of hotel reviews

机译:带有酒店评论意见挖掘的情感分析框架

获取原文

摘要

The rapid increase in mountains of unstructured textual data accompanied by proliferation of tools to analyse them has opened up great opportunities and challenges for text mining research. The automatic labelling of text data is hard because people often express opinions in complex ways that are sometimes difficult to comprehend. The labelling process involves huge amount of efforts and mislabelled datasets usually lead to incorrect decisions. In this paper, we design a framework for sentiment analysis with opinion mining for the case of hotel customer feedback. Most available datasets of hotel reviews are not labelled which presents a lot of works for researchers as far as text data pre-processing task is concerned. Moreover, sentiment datasets are often highly domain sensitive and hard to create because sentiments are feelings such as emotions, attitudes and opinions that are commonly rife with idioms, onomatopoeias, homophones, phonemes, alliterations and acronyms. The proposed framework is termed sentiment polarity that automatically prepares a sentiment dataset for training and testing to extract unbiased opinions of hotel services from reviews. A comparative analysis was established with Naïve Bayes multinomial, sequential minimal optimization, compliment Naïve Bayes and Composite hypercubes on iterated random projections to discover a suitable machine learning algorithm for the classification component of the framework.
机译:大量非结构化文本数据的迅速增加以及分析工具的泛滥,为文本挖掘研究带来了巨大的机遇和挑战。文本数据的自动标记非常困难,因为人们常常以有时难以理解的复杂方式表达意见。标注过程涉及大量工作,标注错误的数据集通常会导致错误的决策。在本文中,我们针对酒店客户反馈的情况设计了一个使用观点挖掘进行情感分析的框架。关于文本数据预处理任务,大多数可用的酒店评论数据集都没有标记,这为研究人员提供了大量工作。此外,情感数据集通常是高度领域敏感的,并且很难创建,因为情感是诸如情感,态度和观点之类的感觉,通常带有成语,拟声词,同音词,音素,称谓和首字母缩写词。所提出的框架被称为情感极性,它自动准备用于训练和测试的情感数据集,以从评论中提取酒店服务的公正观点。在迭代随机投影上,采用朴素贝叶斯多项式,顺序最小优化,补充朴素贝叶斯和复合超立方体建立了比较分析,以发现适用于框架分类组件的机器学习算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号