首页> 外文期刊>Procedia Computer Science >An Ensemble Technique to Classify Multi-Class Textual Emotion
【24h】

An Ensemble Technique to Classify Multi-Class Textual Emotion

机译:一个分类多级文本情感的合奏技术

获取原文
获取外文期刊封面目录资料

摘要

Classifying textual emotion plays a critical role in several HCI applications where the text is utilized as a central means of communication such as messages, reviews, blogs and other Web 2.0 platforms. The extensive usage of the Internet has emerged as an unprecedented means for people to express their feelings or emotion on blogs, social media, and e-commerce sites in recent years. Most of the emotions displayed on the online platforms are in textual forms (such as posts, tweets, comments and reviews), which are unorganized and time-consuming to structured due to their disordered forms. Although several emotion analysis tools are available in high-resource languages, it is critical to developing an automatic emotion classification system for low-resource languages, including Bengali, due to its constrained resources. This paper presents an ML-based ensemble method to classify six primary textual emotions (anger, fear, disgust, sadness, surprise and joy) from Bengali texts. An emotion corpus containing 8047 Bengali texts is developed to perform the textual emotion classification task. This work investigates eight standard ML-based techniques such as logistic regression (LR), multinomial naive Bayes (MNB), support vector machine (SVM), random forest (RF), decision tree (DT), K-nearest neighbour (KNN) and adaptive boosting (AdaBoost) and an ensemble method (a combination of LR, RF, SVM) with Bag of words (BoW) and tf-idf feature extraction techniques. The experimental result demonstrates that the ensemble with tf-idf achieved the highest weightedf1-score of 62.39% compared to other methods.
机译:分类文本情绪在几种HCI应用程序中发挥着关键作用,文本用作媒体,媒体,诸如消息,评论,博客和其他Web 2.0平台等中央通信手段。互联网的广泛使用情况成为近年来博客,社交媒体和电子商务网站对人们表达自己的感受或情感的前所未有的手段。在线平台上显示的大多数情绪都是文本形式(例如帖子,推文,评论和评论),由于它们无序的形式而言,这是对结构的无组织和耗时。虽然有几种高资源语言提供了几种情感分析工具,但由于其约束资源,为开发用于低资源语言的自动情感分类系统至关重要。本文提出了一种基于ML的集合方法,将来自孟加拉文本的六个主要文本情绪(愤怒,恐惧,厌恶,悲伤,惊喜和喜悦)分类。开发了包含8047孟加拉语文本的情感语料库,以执行文本情感分类任务。本工作调查了八种基于ML的基础技术,如Logistic回归(LR),多项式幼稚贝叶斯(MNB),支持向量机(SVM),随机森林(RF),决策树(DT),K最近邻居(KNN)和自适应升压(Adaboost)和具有单词(弓)和TF-IDF特征提取技术的袋子(LR,RF,SVM)的集合方法(LR,RF,SVM组合)。实验结果表明,与其他方法相比,具有TF-IDF的集合达到了62.39%的最高重量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号