首页> 外文会议>Workshop on Figurative Language Processing >Sarcasm Detection using Context Separators in Online Discourse
【24h】

Sarcasm Detection using Context Separators in Online Discourse

机译:使用在线话语中使用上下文分隔符的讽刺检测

获取原文

摘要

Sarcasm is an intricate form of speech, where meaning is conveyed implicitly. Being a convoluted form of expression, detecting sarcasm is an assiduous problem. The difficulty in recognition of sarcasm has many pitfalls, including misunderstandings in everyday communications, which leads us to an increasing focus on automated sarcasm detection. In the second edition of the Figurative Language Processing (FigLang 2020) workshop, the shared task of sarcasm detection released two datasets, containing responses along with their context sampled from Twitter and Reddit. In this work, we use RoBERTa_(large) to detect sarcasm in both the datasets. We further assert the importance of context in improving the performance of contextual word embedding based models by using three different types of inputs - Response-only, Context-Response, and Context-Response (Separated). We show that our proposed architecture performs competitively for both the datasets. We also show that the addition of a separation token between context and target response results in an improvement of 5.13% in the F1-score in the Reddit dataset.
机译:讽刺是一种复杂的语音形式,其中含义隐含地传达。是一种追溯的表达形式,检测讽刺是一个孜孜不倦的问题。认识到讽刺的难度有许多陷阱,包括日常通信中的误解,这导致我们越来越重视自动讽刺检测。在比喻语言处理的第二版(杰出2020)的工作室中,讽刺检测的共享任务释放了两个数据集,其中包含与从Twitter和Reddit采样的上下文的响应。在这项工作中,我们使用Roberta_(大)来检测两个数据集中的讽刺。我们通过使用三种不同类型的输入来提高基于模型的语境词嵌入模型的性能的重要性 - 仅限响应,上下文 - 响应和上下文 - 响应(分离)来提高基于语境字的模型的性能。我们表明我们所提出的架构竞争性地对数据集进行竞争性。我们还表明,在Reddit数据集中的F1分数中增加了上下文和目标响应之间的分离令牌导致5.13%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号