首页> 外文会议>Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >'President Vows to Cut Taxes Hair': Dataset and Analysis of Creative Text Editing for Humorous Headlines
【24h】

'President Vows to Cut Taxes Hair': Dataset and Analysis of Creative Text Editing for Humorous Headlines

机译:“总统发誓要减税”:幽默报道的数据集和创意文本编辑分析

获取原文

摘要

We introduce, release, and analyze a new dataset. called Humicroedit, for research in computational humor. Our publicly available data consists of regular English news headlines paired with versions of the same headlines that contain simple replacement edits designed to make them funny. We carefully curated crowdsourced editors to create funny headlines and judges to score a to a total of 15,095 edited headlines, with five judges per headline. The simple edits, usually just a single word replacement, mean we can apply straightforward analysis techniques to determine what makes our edited headlines humorous. We show how the data support classic theories of humor, such as incongruity, superiority, and setup/punchline. Finally, we develop baseline classifiers that can predict whether or not an edited headline is funny, which is a first step toward automatically generating humorous headlines as an approach to creating topical humor.
机译:我们介绍,发布和分析新的数据集。称为Humicroedit,用于计算幽默的研究。我们的公开数据包括常规的英语新闻标题和相同标题的版本,这些标题包含旨在使它们变得有趣的简单替换编辑。我们精心策划了众包编辑,以创建有趣的头条新闻,并为评委们打分,使其总共获得了15095个编辑过的头条新闻,每个头条新闻都有五名评委。简单的编辑(通常只是一个单词的替换)意味着我们可以应用简单的分析技术来确定是什么使我们的编辑标题变得幽默。我们将展示数据如何支持幽默的经典理论,例如不一致,优越性和设置/旁观者。最后,我们开发了可以预测编辑的标题是否有趣的基线分类器,这是朝着自动生成幽默标题作为创建主题幽默的方法迈出的第一步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号