EmoCNN: Encoding Emotional Expression from Text to Word Vector and Classifying Emotions—A Case Study in Thai Social Network Conversation

Konlakorn Wongpatikaseree; Yongyos Kaewpitakkun; Sumeth Yuenyong; Siriwon Matsuo; Panida Yomaboot

首页> 外文期刊>Engineering journal >EmoCNN: Encoding Emotional Expression from Text to Word Vector and Classifying Emotions—A Case Study in Thai Social Network Conversation

【24h】

EmoCNN: Encoding Emotional Expression from Text to Word Vector and Classifying Emotions—A Case Study in Thai Social Network Conversation

机译：Emocnn：从文本到文字传染媒介和分类情绪编码情绪表达 - 以泰国社交网络对话为例

获取原文

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present EmoCNN, a collection of specially-trained word embedding layer and convolutional neural network model for the classification of conversational texts into 4 types of emotion. This model is part of a chatbot for depression evaluation. The difficulty in classifying emotion from conversational text is that most word embeddings are trained with emotionally-neutral corpus such as Wikipedia or news articles, where emotional words do not appear very often or at all, and the language style is formal writing. We trained a new word embedding based on the word2vec architecture in an unsupervised manner and then fine-tuned it on soft-labelled data. The data was obtained from mining Twitter using emotion keywords. We show that this emotion word embedding can differentiate between words which have the same polarity and words which have opposite polarity, as well as find similar words with the same polarity, while the standard word embedding cannot. We then used this new embedding as the first layer of EmoCNN that classifies conversational text into the 4 emotions. EmoCNN achieved macro-averaged f1-score of 0.76 over the test set. We compared EmoCNN against three different models: a shallow fully-connected neural network, fine-tuning RoBERTa, and ULMFit. These got the best macro-averaged f1-score of 0.5556, 0.6402 and 0.7386 respectively.

机译：我们展示了Emocnn，一系列专门训练的单词嵌入层和卷积神经网络模型，用于将会话文本分类为4种情绪。该模型是抑郁评估的聊天课的一部分。从对话文本分类情绪的困难是，大多数单词嵌入都是用情绪中性的语料库接受培训，例如维基百科或新闻文章，情绪词语并不经常出现，而且语言风格是正式的写作。我们以无监督的方式训练了一个新的单词嵌入式嵌入式嵌入式，然后在软标签数据上微调它。使用Emotion关键字从挖掘Twitter获得数据。我们表明，这种情绪词嵌入可以区分具有相同极性的单词与具有相反极性的单词，以及查找具有相同极性的类似单词，而标准字嵌入不能。然后，我们将此新的嵌入作为第一层Emocnn，将会话文本分类为4个情绪。 Emocnn在测试集上实现了0.76的宏观平均F1分数。我们将Emocnn与三种不同的型号进行比较：一个浅层完全连接的神经网络，微调罗伯塔和Ulmfit。这些具有0.5556,0.6402和0.7386的最佳宏观平均F1分数。

著录项

来源
《Engineering journal》 |2021年第7期|共10页
作者
Konlakorn Wongpatikaseree; Yongyos Kaewpitakkun; Sumeth Yuenyong; Siriwon Matsuo; Panida Yomaboot;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类建筑施工;
关键词
emotion classificationsentiment analysisword embedding;

机译：情感分类分析词嵌入;

相似文献

外文文献
中文文献
专利

1. Text emotion detection in social networks using a novel ensemble classifier based on Parzen Tree Estimator (TPE) [J] . Ghanbari-Adivi Fereshteh, Mosleh Mohammad Neural computing & applications . 2019,第12期

机译：使用基于Parzen Tree Estimer（TPE）的新型集合分类器在社交网络中进行文本情感检测
2. Emotion regulation, emotionality, and expression of emotions: A link between social skills, behavior, and emotion problems in children with ASD and their peers [J] . Reyes Nuri M., Factor Reina, Scarpa Angela Research in developmental disabilities . 2020,第1期

机译：情感调节，情感和情感的情感：社会技能，行为和情感问题的联系，有亚当和对同龄人的儿童
3. Exploring Text-based Emotions Recognition Machine Learning Techniques on Social Media Conversation [J] . Andry Chowanda, Rhio Sutoyo, Meiliana, Procedia Computer Science . 2021,第1期

机译：探索基于文本的情感识别机器学习技巧在社交媒体对话中
4. LSTM-based Text Emotion Recognition Using Semantic and Emotional Word Vectors [C] . Ming-Hsiang Su, Chung-Hsien Wu, Kun-Yi Huang, 2018 First Asian Conference on Affective Computing and Intelligent Interaction . 2018

机译：使用语义和情感词向量的基于LSTM的文本情感识别
5. Event-related potential studies of the effects of mood, self-relevance, and task on the processing of emotional words in social vignettes [D] . Fields, Eric C. 2015

机译：事件相关的潜在研究，涉及情绪，自我相关性和任务对社交短片中情感词处理的影响
6. Basic Emotions in the Nencki Affective Word List (NAWL BE): New Method of Classifying Emotional Stimuli [O] . Małgorzata Wierzba, Monika Riegel, Marek Wypych, -1

机译：Nencki情感词列表（NAWL BE）中的基本情绪：情绪刺激分类的新方法
7. Figure 1: (A) Example of a text-based forma mentis network. A TFMN can be represented either as an edge-coloured graph or as a multiplex network. Positive (negative) words are highlighted in cyan (red). Neutral words are in black. Syntactic links between positive (negative) words are highlighted in cyan (red) too. Syntactic links between positive and negative concepts are in purple. All semantic links of meaning overlap are highlighted in green. (B) Infographics about how a TFMN is assembled. Individuals organise their knowledge and emotional perception of the real world in their mental lexicon (comic clouds). [O] . -1

机译：图1：（a）基于文本的Forma Mentis网络示例。 TFMN可以用作边缘彩色图形或作为多路复用网络表示。在青色（红色）突出显示正（负）单词。中立词是黑色的。在青色（红色）突出显示正（否定）单词之间的句法链接。正面和消极概念之间的句法链接在紫色。含义重叠的所有语义链接都以绿色突出显示。（b）关于TFMN如何组装的信息图表。个人在他们的精神词典（漫画云）中对现实世界组织了他们的知识和情感感知。

EmoCNN: Encoding Emotional Expression from Text to Word Vector and Classifying Emotions—A Case Study in Thai Social Network Conversation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅