首页> 外文会议>IEEE Symposium Series on Computational Intelligence >Dimensionality Reduction for Sentiment Classification using Machine Learning Classifiers
【24h】

Dimensionality Reduction for Sentiment Classification using Machine Learning Classifiers

机译:使用机器学习分类器进行情感分类的降维

获取原文

摘要

Sentiment analysis intends to identify the opinion either positive or negative given by clients or users from review documents. Sentiment analysis utilizing machine learning strategies faces the issue of high dimensionality of the feature vector. Consequently, a feature reduction strategy is required to dispose of the unessential and noisy elements from the feature vector. Feature reduction techniques selects the prominent features for reducing size of the feature set. The features which are nearly distributed presented by different class in the feature vector, make complexity for the classifier to draw a clear decision boundary. In this work, we proposed two different approaches (i.e., Term Presence Count (TPC) and Term Presence Ratio (TPR)) to remove those redundant features in positively and negatively tagged documents with nearly equal distribution. We applied four machine learning-based classification techniques including Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and Naïve Bayes (NB) for sentiment classification using movie review dataset. Finally, the classifiers are evaluated in terms of accuracy, precision, recall, and Average F-measure. Experimental results manifest that the feature dimension reduced to approximately 83% by our proposed method while improving the classification performance.
机译:情绪分析旨在从审查文档中识别出客户或用户给出的正面或负面意见。利用机器学习策略的情感分析面临着特征向量的高维性问题。因此,需要一种特征缩减策略来处理特征向量中不必要和嘈杂的元素。特征缩减技术选择突出的特征以减小特征集的大小。特征向量中不同类别呈现的几乎分布的特征使分类器绘制清晰的决策边界变得很复杂。在这项工作中,我们提出了两种不同的方法(即术语存在计数(TPC)和术语存在比率(TPR)),以消除具有几乎相等分布的正负标签文档中的那些冗余特征。我们应用了四种基于机器学习的分类技术,包括Logistic回归(LR),支持向量机(SVM),随机森林(RF)和朴素贝叶斯(NB),用于使用电影评论数据集进行情感分类。最后,根据准确性,准确性,召回率和平均F值对分类器进行评估。实验结果表明,在改进分类性能的同时,我们提出的方法将特征尺寸降低到了约83%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号