【24h】

Study on feature selection and machine learning algorithms for Malay sentiment classification

机译:马来情绪分类的特征选择和机器学习算法研究

获取原文
获取原文并翻译 | 示例

摘要

Online social media is used to show the sentiments of different individuals about various subjects. Sentiment analysis or opinion mining has recently been considered as one of the highly dynamic research fields in natural language processing, Web mining, and machine learning. There has been a very limited amount of research that focuses on sentiment analysis in the Malay language. This study investigates how feature selection methods contribute to the improvement of Malay sentiment classification performance. Three supervised machine-learning classifiers and seven feature selection methods are used to conduct a series of experiments for the effective selection of the appropriate methods for the automatic sentiment classification of online Malay-written reviews. Findings show that the classifications of Malay sentiment improve using feature selections approaches. This work demonstrates that all feature reduction methods generally improve classifier performance. Support Vector Machine (SVM) approach provide the highest accuracy performance of features selection in order to classify Malay sentiment comparing with other classifications approaches such as PCA and CHI square. SVM records 87% as experimental accuracy result of feature selection.
机译:在线社交媒体用于显示不同个人对各种主题的看法。情感分析或观点挖掘最近被认为是自然语言处理,Web挖掘和机器学习中高度动态的研究领域之一。很少有研究专注于用马来语进行情感分析。这项研究调查了特征选择方法如何促进马来情绪分类性能的提高。使用三个监督的机器学习分类器和七个特征选择方法来进行一系列实验,以有效地选择适当的方法,以对在线马来文评论进行自动情感分类。研究结果表明,使用特征选择方法可以改善马来人情绪的分类。这项工作表明,所有特征约简方法通常都会提高分类器性能。与其他分类方法(例如PCA和CHI方)相比,支持向量机(SVM)方法提供了最准确的特征选择性能,以便对马来语情绪进行分类。 SVM记录了87%作为特征选择的实验精度结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号