首页> 外文会议>Conference on empirical methods in natural language processing >That's So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using #petpeeve Tweets

【24h】

That's So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using #petpeeve Tweets

机译：太烦人了!!!：一种基于词法和框架语义嵌入的数据增强方法，使用#petpeeve Tweets自动对烦人的行为进行分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a novel data augmentation approach to enhance computational behavioral analysis using social media text. In particular, we collect a Twitter corpus of the descriptions of annoying behaviors using the #petpeeve hashtags. In the qualitative analysis, we study the language use in these tweets, with a special focus on the fine-grained categories and the geographic variation of the language. In quantitative analysis, we show that lexical and syntactic features are useful for automatic categorization of annoying behaviors, and frame-semantic features further boost the performance; that leveraging large lexical embeddings to create additional training instances significantly improves the lexical model; and incorporating frame-semantic embedding achieves the best overall performance.

机译：我们提出了一种新颖的数据增强方法，以增强使用社交媒体文本的计算行为分析。特别是，我们使用#petpeeve主题标签收集了有关令人讨厌的行为的描述的Twitter语料库。在定性分析中，我们研究了这些推文中的语言使用，特别关注语言的细粒度类别和地理变化。在定量分析中，我们表明词汇和句法特征对于烦人行为的自动分类很有用，而框架语义特征则进一步提高了性能。利用大型词法嵌入来创建其他训练实例的方法大大改善了词法模型;并结合使用框架语义嵌入可实现最佳的整体性能。

著录项

来源
《Conference on empirical methods in natural language processing》|2015年|2557-2563|共7页
会议地点
作者
William Yang Wang; Diyi Yang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Automatic Spectroscopic Data Categorization by Clustering Analysis (ASCLAN): A Data-Driven Approach for Distinguishing Discriminatory Metabolites for Phenotypic Subclasses [J] . Zou Xin, Holmes Elaine, Nicholson Jeremy K., Analytical chemistry . 2016,第11期

机译：通过聚类分析（ASCLAN）进行自动光谱数据分类：区分表型亚类的歧视性代谢物的数据驱动方法
2. SegChainW2V: Towards a Generic Automatic Video Segmentation Framework, Based on Lexical Chains of Audio Transcriptions and Word Embeddings [J] . Adrian-Gabriel Chifu, Sébastien Fournier Procedia Computer Science . 2016,第1期

机译：SegChainW2V：建立一个基于音频转录和词嵌入的词法链的通用自动视频分割框架
3. Detecting misogyny in Spanish tweets. An approach based on linguistics features and word embeddings [J] . Jose Antonio Garcia-Diaz, Mar Canovas-Garcia, Ricardo Colomo-Palacios, Future generation computer systems . 2021,第Jana期

机译：在西班牙语推文中检测厌叫。一种基于语言学特征和单词嵌入的方法
4. That's So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using #petpeeve Tweets [C] . William Yang Wang, Diyi Yang Conference on empirical methods in natural language processing . 2015

机译：那太烦人了!!!：基于词汇和帧语义嵌入的数据增强方法，使用#petPeeve推文自动分类烦人行为
5. Neural and behavioral correlates of similarity-based categorization: An event-related potential approach. [D] . Azizian, Allen. 2004

机译：神经和行为相关的基于相似性的分类：一种与事件相关的潜在方法。
6. Automatic CNN-based detection of cardiac MR motion artefacts using k-space data augmentation and curriculum learning [O] . Ilkay Oksuz, Bram Ruijsink, Esther Puyol-Antón, -1

机译：使用k空间数据扩充和课程学习基于CNN的心脏MR运动伪影自动检测
7. That’s So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using #petpeeve Tweets ∗ [O] . William Yang Wang, Diyi Yang 2015

机译：这太烦人了!!!：基于词汇和框架语义嵌入的数据增强方法使用#petpeeve推文自动分类恼人的行为*

That's So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using #petpeeve Tweets

摘要

著录项

相似文献

相关主题

期刊订阅