首页> 外文期刊>International Journal of Intelligent Systems and Applications >Emotion Detection of Tweets in Indonesian Language using Non-Negative Matrix Factorization
【24h】

Emotion Detection of Tweets in Indonesian Language using Non-Negative Matrix Factorization

机译:基于非负矩阵分解的印尼语推文情感检测

获取原文
       

摘要

Emotion detection is an application that is widely used in social media for industrial environment, health, and security problems. Twitter is ashort text messageknown as tweet. Based on content and purposes, the tweet can describes as information about a user’s emotion. Emotion detection by means oftweet, is a challenging problem because only a few features can be extracted. Getting features related to emotion is important at the first phase of extraction, so the appropriate features such as a hashtag, emoji, emoticon, and adjective terms are needed. We propose a new method for analyzing the linkages among features and reducedsemantically using Non-Negative Matrix Factorization (NMF). The dataset is taken from a Twitter application using Indonesian language with normalization of informal terms in advance. There are 764 tweets in corpus which have five emotions, i.e. happy (senang), angry (marah), fear (takut), sad (sedih), and surprise(terkejut). Then, the percentage of user’s emotion is computed by k-Nearest Neighbor(kNN) approach. Our proposed model achieves the problem of emotion detectionwhich is proved by the result near ground truth.
机译:情绪检测是一种在社交媒体中广泛用于解决工业环境,健康和安全问题的应用程序。 Twitter是一条短文本消息,称为tweet。根据内容和目的,该推文可以描述为有关用户情绪的信息。通过tweet进行情感检测是一个具有挑战性的问题,因为只能提取一些特征。在提取的第一阶段,获取与情感相关的功能很重要,因此需要适当的功能,例如主题标签,表情符号,表情符号和形容词。我们提出了一种新的方法来分析特征之间的联系,并使用非负矩阵分解(NMF)进行简化。该数据集来自使用印度尼西亚语的Twitter应用程序,并且预先对非正式用语进行了规范化。语料库中有764条推文,它们具有五种情感,即快乐(senang),愤怒(marah),恐惧(taku​​t),悲伤(sedih)和惊奇(terkejut)。然后,通过k最近邻(kNN)方法计算用户的情感百分比。我们提出的模型解决了情感检测的问题,这一点已由近乎真实的结果证明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号