首页> 外文会议>International joint conference on natural language processing >Telling the Whole Story: A Manually Annotated Chinese Dataset for the Analysis of Humor in Jokes
【24h】

Telling the Whole Story: A Manually Annotated Chinese Dataset for the Analysis of Humor in Jokes

机译:讲述整个故事:一个手动注释的中国数据集,用于分析笑话中的幽默

获取原文

摘要

Humor plays important role in human communication, which makes it important problem for natural language processing. Prior work on the analysis of humor focuses on whether tex-t is humorous or not. or the degree of funni-ness, but this is insufficient to explain why it is funny. We therefore create a dataset on humor with 9,123 manually annotated jokes in Chinese. We propose a novel annotation scheme to give scenarios of how humor arises in tex-t. Specifically, our annotations of linguistic humor not only contain the degree of funni-ness, like previous work, but they also contain key words that trigger humor as well as character relationship, scene, and humor categories. We report reasonable agreement between annotators. We also conduct an analysis and exploration of the dataset. To the best of our knowledge, we are the first to approach humor annotation for exploring the underlying mechanism of the use of humor, which may contribute to a significantly deeper analysis of humor. We also contribute with a scarce and valuable dataset, which we will release publicly.
机译:幽默在人类交流中发挥着重要作用,这使得自然语言处理成为重要问题。在幽默分析的情况下,专注于Tex-T是否幽默。或Funni-ness的程度,但这不足以解释为什么它很有趣。因此,我们在幽默中创建一个数据集,其中包含9,123个手动注释的笑话。我们提出了一种新颖的注释计划,以提供幽默在TEX-T中所产生的方案。具体而言,我们的语言幽默的注释不仅包含有趣的程度,就像以前的工作一样,但它们也包含触发幽默以及性格关系,场景和幽默类别的关键词。我们报告了注册人之间的合理协议。我们还对数据集进行了分析和探索。据我们所知,我们是第一个接近幽默的幽默诠释,探索使用幽默的潜在机制,这可能导致对幽默的显着深入分析。我们还贡献了稀缺和有价值的数据集,我们将公开发布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号