首页> 外文会议>International joint conference on natural language processing;Conference on empirical methods in natural language processing >Telling the Whole Story: A Manually Annotated Chinese Dataset for the Analysis of Humor in Jokes
【24h】

Telling the Whole Story: A Manually Annotated Chinese Dataset for the Analysis of Humor in Jokes

机译:讲故事:一个手工注释的中文数据集,用于分析笑话中的幽默

获取原文

摘要

Humor plays important role in human communication, which makes it important problem for natural language processing. Prior work on the analysis of humor focuses on whether tex-t is humorous or not. or the degree of funni-ness, but this is insufficient to explain why it is funny. We therefore create a dataset on humor with 9,123 manually annotated jokes in Chinese. We propose a novel annotation scheme to give scenarios of how humor arises in tex-t. Specifically, our annotations of linguistic humor not only contain the degree of funni-ness, like previous work, but they also contain key words that trigger humor as well as character relationship, scene, and humor categories. We report reasonable agreement between annotators. We also conduct an analysis and exploration of the dataset. To the best of our knowledge, we are the first to approach humor annotation for exploring the underlying mechanism of the use of humor, which may contribute to a significantly deeper analysis of humor. We also contribute with a scarce and valuable dataset, which we will release publicly.
机译:幽默在人类交流中起着重要作用,这使其成为自然语言处理中的重要问题。先前关于幽默分析的工作集中于tex-t是否幽默。或有趣程度,但这不足以解释为什么有趣。因此,我们使用9123个手动注释的中文笑话创建了关于幽默的数据集。我们提出了一种新颖的注释方案,以给出幽默如何在tex-t中产生的场景。具体来说,我们对语言幽默的注释不仅像以前的作品一样包含了有趣的程度,而且还包含触发幽默的关键词以及人物关系,场景和幽默类别。我们报告了注释者之间的合理协议。我们还将对数据集进行分析和探索。据我们所知,我们是第一个使用幽默注释来探索幽默使用的潜在机制的人,这可能有助于对幽默进行更深入的分析。我们还提供了稀有而有价值的数据集,并将其公开发布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号