首页> 外文会议>International Conference on Informatics and Computing >Choosing The Most Optimum Text Preprocessing Method for Sentiment Analysis: Case:iPhone Tweets
【24h】

Choosing The Most Optimum Text Preprocessing Method for Sentiment Analysis: Case:iPhone Tweets

机译:选择最佳的情绪分析的文本预处理方法:案例:iPhone推文

获取原文

摘要

Preprocessing is the initial stage in several text processing tasks, including sentiment analysis. Preprocessing is an important step in sentiment analysis because it could affect the result accuracy significantly. However, previous studies on preprocessing that discussed the selection of preprocessing methods were rarely conducted. In this study, we analyze the effect of preprocessing methods on sentiment analysis task. We performed the sentiment analysis as a classification on product opinion, whether the sentiment is positive or negative. We conducted an experiment using Tweets that talk about iPhone. We observed seven different preprocessing methods and the combination of it. The preprocessing methods are: casefolding, expressive lengthening, emoticons handling, removing URLs, slang handling, punctuations handling, stopwords removal and stemming. The results show that a combination of five methods: URL removal, emoticon handling, case folding, expressive lengthening and stemming is the most optimum method with an accuracy of 70.88% on sentiment analysis.
机译:预处理是几个文本处理任务中的初始阶段,包括情感分析。预处理是情感分析的重要步骤,因为它可能会显着影响结果精度。然而,很少进行先前关于预处理的预处理的研究很少进行。在这项研究中,我们分析了预处理方法对情意分析任务的影响。我们对产品意见的分类进行了情感分析,情绪是否正为正或负面。我们使用谈论iPhone的推文进行了一个实验。我们观察了七种不同的预处理方法和它的组合。预处理方法是:案例,表现力延长,表情符号处理,消除URL,俚语处理,点击处理,止动件移除和置换。结果表明,五种方法的组合:URL去除,表情符号处理,案例折叠,表达延长和茎是最优化的方法,精度为70.88%的情绪分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号