首页> 外文会议>10th workshop on computational approaches to subjectivity, sentiment and social media analysis >'When Numbers Matter!': Detecting Sarcasm in Numerical Portions of Text
【24h】

'When Numbers Matter!': Detecting Sarcasm in Numerical Portions of Text

机译:“当数字很重要!”:在文本的数字部分中检测讽刺

获取原文
获取原文并翻译 | 示例

摘要

Research in sarcasm detection spans almost a decade. However a particular form of sarcasm remains unexplored: sarcasm expressed through numbers, which we estimate, forms about 11% of the sarcastic tweets in our dataset. The sentence 'Love waking up at 3 am' is sarcastic because of the number. In this paper, we focus on detecting sarcasm in tweets arising out of numbers. Initially, to get an insight into the problem, we implement a rule-based and a statistical machine learning-based (ML) classifier. The rule-based classifier conveys the crux of the numerical sarcasm problem, namely, incongruity arising out of numbers. The statistical ML classifier uncovers the indicators i.e., features of such sarcasm. The actual system in place, however, are two deep learning (DL) models, CNN and attention network that obtains an F-score of 0.93 and 0.91 on our dataset of tweets containing numbers. To the best of our knowledge, this is the first line of research investigating the phenomenon of sarcasm arising out of numbers, culminating in a detector thereof.
机译:嘲讽检测的研究跨越了近十年。然而,尚未发现一种特殊形式的讽刺:通过数字表达的讽刺(据我们估计)构成了我们数据集中讽刺推文的11%。由于数字太多,“爱情在凌晨3点醒来”这句话很讽刺。在本文中,我们着重于检测数字引发的推文中的讽刺。最初,为了深入了解问题,我们实现了基于规则和基于统计机器学习(ML)的分类器。基于规则的分类器传达了数字讽刺问题的症结,即数字引起的不一致。机器学习统计分类器揭示了这种讽刺的指标,即特征。但是,实际的系统是两个深度学习(DL)模型,即CNN和注意力网络,它们在包含数字的推文数据集上获得0.93和0.91的F评分。据我们所知,这是研究数字引起的讽刺现象的最高研究领域,最终达到了一种检测器的作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号