首页> 外文会议>IEEE/WIC/ACM International Conference on Web Intelligence >Improving the Classification of Drunk Texting in Tweets Using Semantic Enrichment

【24h】

Improving the Classification of Drunk Texting in Tweets Using Semantic Enrichment

机译：利用语义丰富来改善推文中醉酒短信的分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Excessive alcohol consumption is a worldwide problem, and social networks such as Twitter can provide valuable data that help understanding factors related to alcoholism, particularly among youngsters. The identification of drunk tweets (i.e. posted under the influence of alcohol) is complex because tweets are short, sparse and written with diverse and internet specific vocabulary, possibly with errors due to alcohol influence. In this paper, we propose an enriching framework that integrates conceptual and semantic features that expand and generalize the vocabulary, providing context to tweet terms. It also handles misspellings and the selection of discriminative features resulting from contextual enrichment. We outperformed the baseline, achieving improvements of 13.79 percentage points in recall, with no significant harm to precision. We illustrate the value of drunk tweets classification by developing an exploratory analysis that reveals drunk tweeters demographics and tweet properties.

机译：过度饮酒是一个全球性的问题，Twitter等社交网络可以提供有价值的数据，帮助了解与酗酒有关的因素，尤其是在年轻人中。酒后鸣叫的识别（即在酒精的影响下发布）很复杂，因为鸣叫简短，稀疏，并且使用多种多样且特定于互联网的词汇书写，可能由于酒精的影响而产生错误。在本文中，我们提出了一个丰富的框架，该框架整合了概念和语义功能，这些功能扩展和概括了词汇表，并为推文术语提供了上下文。它还可以处理拼写错误以及因上下文丰富而导致的区别性特征的选择。我们的表现优于基准，召回率提高了13.79个百分点，对精度没有明显的损害。我们通过开发探索性分析来说明醉酒式推文分类的价值，该分析揭示了醉酒的高音扬声器的人口统计和推文特性。

著录项

来源
《IEEE/WIC/ACM International Conference on Web Intelligence 》|2018年|190-197|共8页
会议地点
作者
Marcos Grzeça; Karin Becker; Renata Galante;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Semantics; Vocabulary; Twitter; Data mining; Alcoholic beverages;

机译：特征提取;语义;词汇; Twitter;数据挖掘;含酒精饮料;

相似文献

外文文献
中文文献
专利

1. Drink2Vec: Improving the classification of alcohol-related tweets using distributional semantics and external contextual enrichment [J] . Marcos Grzeca, Karin Becker, Renata Galante Information Processing & Management . 2020 ,第6期

机译：drink2vec：使用分布语义和外部上下文富集改进酒精相关的推文的分类
2. A framework for event classification in tweets based on hybrid semantic enrichment [J] . Romero Simone, Becker Karin Expert Systems with Application . 2019 ,第MARa期

机译：基于混合语义丰富的推文事件分类框架
3. Text classification with semantically enriched word embeddings [J] . N. Pittaras, G. Giannakopoulos, G. Papadakis, Natural language engineering . 2021 ,第Pta4期

机译：用语义丰富的单词嵌入文本分类
4. Experiments with Semantic Enrichment for Event Classification in Tweets [C] . Simone Aparecida Pinto Romero, Karin Becker IEEE/WIC/ACM International Conference on Web Intelligence . 2016

机译：推文中用于事件分类的语义丰富实验
5. Methods of Enriching Domain Knowledge with Universal Semantics for Higher Text Mining Performance [D] . Qazanfari, Kazem . 2020

机译：以普通语义丰富域知识的方法，以获得更高的文本挖掘性能
6. Text Semantic Classification of Long Discourses Based on Neural Networks with Improved Focal Loss [O] . Dan Jiang, Jin He 2021

机译：基于神经网络的神经网络文本语义分类改善焦损
7. Text mining with semantic annotation : using enriched text representation for entity-oriented retrieval, semantic relation identification and text clustering [O] . Hou Jun 2014

机译：具有语义注释的文本挖掘：使用丰富的文本表示法进行面向实体的检索，语义关系识别和文本聚类

Improving the Classification of Drunk Texting in Tweets Using Semantic Enrichment

摘要

著录项

相似文献

相关主题

期刊订阅