...
首页> 外文期刊>Journal of computational and theoretical nanoscience >Applying Relief Algorithm for Feature Selection in Sentiment Classification for Movie Reviews
【24h】

Applying Relief Algorithm for Feature Selection in Sentiment Classification for Movie Reviews

机译:应用救济算法在电影评论中为情商分类中的特征选择

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper mainly focuses on using the Relief technique to improve the performance of sentiment classification (specifically the Na?ve Bayes classifier). We also present a way to find a suitable feature selection technique that can support the Na?ve Bayes classifier. We willcompare the Relief Algorithm with other feature selection methods that are currently considered innovative and which are featured in the latest research papers, namely the Information-Gain and Chi-Square techniques. In addition, we will present the calculation processes of the feature weightsof the data in order to compare the effectiveness of each feature selection method. The most commonly accepted approach to the sentiment classification problem is to first start by extracting and then selecting text features. Then the step in the feature selection process is to pick out thekeyword which is then to be used in the main preprocessing part that is critical to the indexing of documents. From the results of our study, it can be seen that utilizing the Na?ve Bayes Classifier without feature selection results in a relatively lower score. For the large movie reviewdataset, without feature selection we get a result of 78.80%. However, when the relief method is applied, the score is improved to 80.30%. In fact, when using the Na?ve Bayes Classifier with the Chi-Square and Information Gain methods separately, the score returned is slightly higheras well, giving off a score of 80.40%. In the future, improvements to these methods could be made but along with the improvements in our technology, we may witness more efficient methods to sentiment analysis due to either more efficient algorithms or a more rapid execution of algorithms dueto improved computational executions. Eventually, the goal would be to surpass the 80% region and to eventually break into at least the 90% region of accuracy.
机译:本文主要侧重于采用浮雕技术来提高情绪分类的性能(特别是Na'Ve Bayes分类器)。我们还提供了一种方法来找到一个适当的特征选择技术,可以支持Na ve Bayes分类器。我们将使用当前认为具有创新性的其他特征选择方法进行救援算法,并在最新的研究论文中具有特色,即信息增益和Chi-Square技术。另外,我们将介绍数据的特征权重的计算过程,以便比较每个特征选择方法的有效性。最常见的情绪分类问题的方法是首先通过提取然后选择文本特征。然后,要素选择过程中的步骤是挑出键的Keyword,然后在主要预处理部分中使用,这对于文档的索引至关重要。从我们的研究结果来看,可以看出利用Na ve贝雷斯分类器而没有特征选择导致得分相对较低。对于大型电影readyDataset,没有功能选择,我们得到了78.80%的结果。但是,当应用浮雕方法时,分数提高到80.30%。实际上,当使用Na ve Bayes分类器与Chi-Square和信息分别获得方法时,返回的分数略高于略高,发出80.40%的分数。将来,可以提高这些方法的改进,但随着我们技术的改进,我们可以引用由于更有效的算法或更快速地执行算法Dueto改进的计算执行而导致的情绪分析更有效。最终,目标是超越80%的地区,最终将至少分成90%的准确性区域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号