首页> 外文会议>International Conference on Information and Communication Technology >On The Feature Extraction For Sentiment Analysis of Movie Reviews Based on SVM
【24h】

On The Feature Extraction For Sentiment Analysis of Movie Reviews Based on SVM

机译:基于SVM的电影评论情感分析特征提取

获取原文

摘要

Watching a movie is one of the activities that reduce bored, so it is necessary to look for information about the movie, which is packaged in the form of a movie review to determine whether the movie considered for viewing or no. However, in searching for information through movie reviews, there are obstacles because there are many reviews conducted by reviewers. Therefore, sentiment analysis is needed aims to classify the movie review into positive and negative sentiments. Machine learning methods can use as a sentiment analysis classification because that can produce the best performance, the method called Support Vector Machine (SVM). That was a reason SVM classification used in sentiment analysis on movie review data. Use feature extraction of Term Frequency- Inverse Document Frequency (TF-IDF) was also carried out in the research this as a method of weighting words which then combined with the extraction of Latent features Dirichlet Allocation (LDA) as a method of modeling topics that can overcome the shortcomings of SVM. This research produced the best performance on a combination of TF-IDF and LDA, with 240 topics has 29792 features, which is 82.16%.
机译:观看电影是减少乏味的一项活动,因此有必要查找有关该电影的信息,这些信息以电影评论的形式打包在一起,以确定是否考虑观看该电影。但是,在通过电影评论搜索信息时,会遇到障碍,因为评论者进行的评论很多。因此,需要进行情感分析以将电影评论分为积极情感和消极情感。机器学习方法可以用作情感分析分类,因为它可以产生最佳性能,该方法称为支持向量机(SVM)。这就是SVM分类用于电影评论数据的情感分析的原因。在这项研究中,还进行了术语频率逆文档频率(TF-IDF)的使用特征提取,作为加权单词的方法,然后与潜在特征Dirichlet分配(LDA)的提取相结合,作为对主题进行建模的方法,可以克服SVM的缺点。这项研究在TF-IDF和LDA的组合上产生了最佳性能,其中240个主题具有29792个特征,占82.16%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号