首页> 外文会议>International conference on language resources and evaluation >Collecting Humorous Expressions from a Community-based Question-answering-service Corpus
【24h】

Collecting Humorous Expressions from a Community-based Question-answering-service Corpus

机译:从基于社区的问答服务语料库中收集幽默的表达

获取原文

摘要

We proposed a method of collecting humorous expressions from an online community-based question-answering (CQA) corpus where some users post a variety of questions and other users post relevant answers. Although the service is created for the purpose of knowledge exchange, there are users who enjoy posting humorous responses. Therefore, the corpus contains many interesting humour communication examples that might be useful in understanding the nature of online communications and variations in humour. Considering the size of 3,116,009 topics, it is necessary to introduce automation in the collection process. However, due to the context dependency of humour expressions, it is hard to collect them automatically by using keywords or key phrases. Our method uses natural language processing based on dissimilarity criteria between answer texts. By using this method, we can collect humour expressions more efficiently than by manual exploration: 30 times more examples per hour.
机译:我们提出了一种从在线的基于社区的问答(CQA)语料库中收集幽默表达的方法,其中一些用户发布各种问题,而其他用户发布相关的答案。尽管创建服务的目的是为了进行知识交流,但仍有一些用户喜欢张贴幽默的回复。因此,语料库包含许多有趣的幽默交流示例,这些示例可能有助于理解在线交流的本质和幽默感。考虑到3,116,009个主题的大小,有必要在收集过程中引入自动化。但是,由于幽默表达的上下文相关性,很难通过使用关键字或关键短语自动收集它们。我们的方法基于答案文本之间的相似性标准使用自然语言处理。通过使用这种方法,我们可以比通过手工探索更有效地收集幽默表达:每小时多30倍的示例。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号