首页> 外文会议>2016 International Seminar on Application for Technology of Information and Communication >Evaluation of classification methods for Indonesian text emotion detection
【24h】

Evaluation of classification methods for Indonesian text emotion detection

机译:印尼文字情感检测分类方法的评价

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper presents Indonesian text emotion detection and evaluates the performances of four different classification methods: Naive Bayes (NB), J48, K-Nearest Neighbor (KNN) and Support Vector Machine-Sequential Minimal Optimization (SVM-SMO). The experiment uses Indonesian text corpus, containing 1000 sentences which consists of six emotion classes: anger, disgust, fear, joy, sadness, and surprise. Preprocessing step which consists of tokenization, case normalization, stopword removal, stemming and TFIDF are used to extract the features of text emotion. We conduct 10-fold cross validation and split validation for the experiment. Based on the result, we conclude that SVM-SMO classifier gives the best performance. In the 10-fold cross validation, the result shows that the accuracy of NB, J48, KNN and SVM-SMO are 80.2%, 80.8%, 68.1%, and 85.5% respectively. The same conclusion is also demonstrated by the split validation, the highest accuracy of 86% is also achieved by SVM-SMO.
机译:本文介绍了印尼文字情感检测,并评估了四种不同的分类方法的性能:朴素贝叶斯(NB),J48,K最近邻(KNN)和支持向量机顺序最小优化(SVM-SMO)。实验使用印度尼西亚语文本语料库,其中包含1000个句子,其中包括六个情感类别:愤怒,厌恶,恐惧,喜悦,悲伤和惊奇。预处理步骤由标记化,大小写规范化,停用词去除,词干和TFIDF组成,用于提取文本情感特征。我们对实验进行10倍交叉验证和拆分验证。根据结果​​,我们得出结论,SVM-SMO分类器可提供最佳性能。在10倍交叉验证中,结果表明NB,J48,KNN和SVM-SMO的准确性分别为80.2%,80.8%,68.1%和85.5%。分割验证也证明了相同的结论,SVM-SMO也达到了86%的最高准确度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号