首页> 外文会议>IEEE/WIC/ACM International Conference on Web Intelligence >Improving the Classification of Drunk Texting in Tweets Using Semantic Enrichment
【24h】

Improving the Classification of Drunk Texting in Tweets Using Semantic Enrichment

机译:利用语义丰富来改善推文中醉酒短信的分类

获取原文

摘要

Excessive alcohol consumption is a worldwide problem, and social networks such as Twitter can provide valuable data that help understanding factors related to alcoholism, particularly among youngsters. The identification of drunk tweets (i.e. posted under the influence of alcohol) is complex because tweets are short, sparse and written with diverse and internet specific vocabulary, possibly with errors due to alcohol influence. In this paper, we propose an enriching framework that integrates conceptual and semantic features that expand and generalize the vocabulary, providing context to tweet terms. It also handles misspellings and the selection of discriminative features resulting from contextual enrichment. We outperformed the baseline, achieving improvements of 13.79 percentage points in recall, with no significant harm to precision. We illustrate the value of drunk tweets classification by developing an exploratory analysis that reveals drunk tweeters demographics and tweet properties.
机译:过度饮酒是一个全球性的问题,Twitter等社交网络可以提供有价值的数据,帮助了解与酗酒有关的因素,尤其是在年轻人中。酒后鸣叫的识别(即在酒精的影响下发布)很复杂,因为鸣叫简短,稀疏,并且使用多种多样且特定于互联网的词汇书写,可能由于酒精的影响而产生错误。在本文中,我们提出了一个丰富的框架,该框架整合了概念和语义功能,这些功能扩展和概括了词汇表,并为推文术语提供了上下文。它还可以处理拼写错误以及因上下文丰富而导致的区别性特征的选择。我们的表现优于基准,召回率提高了13.79个百分点,对精度没有明显的损害。我们通过开发探索性分析来说明醉酒式推文分类的价值,该分析揭示了醉酒的高音扬声器的人口统计和推文特性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号