'When Numbers Matter!': Detecting Sarcasm in Numerical Portions of Text

机译：“当数字很重要！”：在文本的数字部分中检测讽刺

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Research in sarcasm detection spans almost a decade. However a particular form of sarcasm remains unexplored: sarcasm expressed through numbers, which we estimate, forms about 11% of the sarcastic tweets in our dataset. The sentence 'Love waking up at 3 am' is sarcastic because of the number. In this paper, we focus on detecting sarcasm in tweets arising out of numbers. Initially, to get an insight into the problem, we implement a rule-based and a statistical machine learning-based (ML) classifier. The rule-based classifier conveys the crux of the numerical sarcasm problem, namely, incongruity arising out of numbers. The statistical ML classifier uncovers the indicators i.e., features of such sarcasm. The actual system in place, however, are two deep learning (DL) models, CNN and attention network that obtains an F-score of 0.93 and 0.91 on our dataset of tweets containing numbers. To the best of our knowledge, this is the first line of research investigating the phenomenon of sarcasm arising out of numbers, culminating in a detector thereof.

机译：嘲讽检测的研究跨越了近十年。然而，尚未发现一种特殊形式的讽刺：通过数字表达的讽刺（据我们估计）构成了我们数据集中讽刺推文的11％。由于数字太多，“爱情在凌晨3点醒来”这句话很讽刺。在本文中，我们着重于检测数字引发的推文中的讽刺。最初，为了深入了解问题，我们实现了基于规则和基于统计机器学习（ML）的分类器。基于规则的分类器传达了数字讽刺问题的症结，即数字引起的不一致。机器学习统计分类器揭示了这种讽刺的指标，即特征。但是，实际的系统是两个深度学习（DL）模型，即CNN和注意力网络，它们在包含数字的推文数据集上获得0.93和0.91的F评分。据我们所知，这是研究数字引起的讽刺现象的最高研究领域，最终达到了一种检测器的作用。

著录项

来源
《10th workshop on computational approaches to subjectivity, sentiment and social media analysis》|2019年|72-80|共9页
会议地点 Minneapolis(US)
作者
Abhijeet Dubey; Lakshya Kumar; Arpan Somani; Aditya Joshi; Pushpak Bhattacharyya;
展开▼
作者单位

IIT Bombay;

AI Research Einstein Salesforce;

Big Data Labs American Express;

CSIRO;

IIT Bombay;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A pragmatic and intelligent model for sarcasm detection in social media text [J] . Shrivastava Mayank, Kumar Shishir Technology in society . 2021,第Feba期

机译：社交媒体文本中讽刺检测的务实与智能模型
2. Patent Issued for Discovery of Parallel Text Portions in Comparable Collections of Corpora and Training Using Comparable Texts [J] . Robotics and Machine Learning . 2012,第45期

机译：为在可比较语料库中发现并行文本部分并使用可比较文本进行培训而颁发的专利
3. Detecting the target of sarcasm is hard: Really?? [J] . Pradeesh Parameswaran, Andrew Trotman, Veronica Liesaputra, Information Processing & Management . 2021,第4期

机译：检测讽刺的目标很难：真的??
4. 'When Numbers Matter!': Detecting Sarcasm in Numerical Portions of Text [C] . Abhijeet Dubey, Lakshya Kumar, Arpan Somani, Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies . 2019

机译：'当数字几乎没有！'：在文本的数字部分检测讽刺
5. The effects of physical and social activity on the elderly's ability to detect sarcasm. [D] . Henderson, Amy M. 2004

机译：体育和社交活动对老年人发现讽刺能力的影响。
6. White Matter Tracts Critical for Recognition of Sarcasm [O] . Cameron Davis, Kenichi Oishi, Andreia Faria, -1

机译：白色物质对于识别讽刺至关重要
7. How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text [O] . Aditya Joshi, Pushpak Bhattacharyya, Mark Carman, 2016

机译：文化差异如何影响讽刺注释的质量？：以印度注册商和美国文本为例

'When Numbers Matter!': Detecting Sarcasm in Numerical Portions of Text

摘要

著录项

相似文献

相关主题

期刊订阅