Augmenting Semantic Representation of Depressive Language: From Forums to Microblogs

机译：增强压抑语言的语义表示：从论坛到微博

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We discuss and analyze the process of creating word embedding feature representations specifically designed for a learning task when annotated data is scarce, like depressive language detection from Tweets. We start from rich word embedding pre-trained from a general dataset, then enhance it with embedding learned from a domain specific but relatively much smaller dataset. Our strengthened representation portrays better the domain of depression we are interested in as it combines the semantics learned from the specific domain and word coverage from the general language. We present a comparative analyses of our word embedding representations with a simple bag-of-words model, a well known sentiment lexicon, a psycholinguistic lexicon, and a general pre-trained word embedding, based on their efficacy in accurately identifying depressive Tweets. We show that our representations achieve a significantly better F1 score than the others when applied to a high quality dataset.

机译：我们讨论并分析了在注释数据稀缺时（例如从推文中检测到压抑性语言）为学习任务而专门设计的单词嵌入特征表示的过程。我们从从通用数据集中预训练的富词嵌入开始，然后通过从特定于领域但相对较小的数据集中学习到的嵌入来增强它。我们增强的表示法更好地描绘了我们感兴趣的抑郁症领域，因为它结合了从特定领域中学习到的语义和从通用语言中获得的单词覆盖率。我们基于简单的词袋模型，知名的情感词典，心理语言词典和一般的预训练单词嵌入，对它们的词嵌入表示形式进行比较分析，基于它们在准确识别抑郁性推文中的功效。我们表明，当将这些表示应用于高质量数据集时，它们的F1得分明显优于其他表示。

著录项

来源
《European conference on machine learning and principles and practice of knowledge discovery in databases》|2019年|359-375|共17页
会议地点
作者
Nawshad Farruque; Osmar Zaiane; Randy Goebel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Machine learning; Natural language processing; Distributional semantics; Major Depressive Disorder; Social media;

机译：机器学习;自然语言处理;分布语义;严重抑郁症;社交媒体;

相似文献

外文文献
中文文献
专利

1. Learning Semantic Representations from Directed Social Links to Tag Microblog Users at Scale [J] . ACM Transactions on Information Systems . 2020,第2期

机译：从定向社交链接到标记微博用户的大规模学习语义表示
2. Learning Semantic Representations from Directed Social Links to Tag Microblog Users at Scale [J] . Ecological restoration . 2020,第2期

机译：从指示的社交链接学习语义表示，以标记MicroBlog用户在比例下
3. Building associated semantic representation model for the ultra-short microblog text jumping in big data [J] . Zhang Shunxiang, Wang Yin, Zhang Shiyao, Cluster computing . 2016,第3期

机译：为大数据中超短微博文本的跳转建立关联的语义表示模型
4. Social Tension Detection and Intention Recognition Using Natural Language Semantic Analysis: On the Material of Russian-Speaking Social Networks and Web Forums [C] . Vybornova Olga, Smirnov Ivan, Sochenkov Ilya, 2011 European Intelligence and Security Informatics Conference . 2011

机译：使用自然语言语义分析的社会张力检测和意图识别：讲俄语的社交网络和Web论坛的材料
5. A composite semantic communications framework for representation of agent communication language semantics. [D] . Harper, Lois. 2006

机译：用于代理通信语言语义表示的复合语义通信框架。
6. Identifying bilingual semantic neural representations across languages [O] . Augusto Buchweitz, Svetlana V. Shinkareva, Robert A. Mason, -1

机译：识别跨语言的双语语义神经表示
7. Online forums in critical and reflective training of foreign language teachers: a critical thinking representation in phases in/by language [O] . Rozenfeld, Cibele Cecilio De Faria [UNESP] 2014

机译：外语老师的批判性和反思性培训在线论坛：分阶段/按语言进行批判性思维表示
8. Augmented Role Filling Capabilities for Semantic Interpretation of Spoken Language. [R] . Norton, L., Linebarger, M., Dahl, D., 1991

机译：口语语义解释的增强角色填充能力。

Augmenting Semantic Representation of Depressive Language: From Forums to Microblogs

摘要

著录项

相似文献

相关主题

期刊订阅