Generating an Entailment Corpus from News Headlines

机译：从新闻头条生成蕴涵语料库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe our efforts to generate alarge (100,000 instance) corpus of textualentailment pairs from the lead paragraphand headline of news articles. We manuallyinspected a small set of news storiesin order to locate the most productivesource of entailments, then built an annotationinterface for rapid manual evaluationof further exemplars. With thistraining data we built an SVM-baseddocument classifier, which we used forcorpus refinement purposes—we believethat roughly three-quarters of the resultingcorpus are genuine entailment pairs. Wealso discuss the difficulties inherent inmanual entailment judgment, and suggestways to ameliorate some of these.

机译：我们描述了我们为产生大型（100,000个实例）文本语料库引言段中的蕴含对和新闻标题。我们手动检查了一小堆新闻故事为了找到最有生产力的需求的来源，然后建立注释快速手动评估的界面进一步的例子。有了这个训练数据，我们建立了一个基于SVM的文档分类器，我们用于语料库细化目的-我们相信大约四分之三的结果语料库是真正的蕴含对。我们还讨论了固有的困难人工需求判断，并提出建议改善其中一些的方法。

著录项

来源
《43rd Annual Meeting of the Association for Computational Linguistics: Proceeding of the Conference》|2005年|49-54|共6页
会议地点
作者
John Burger; Lisa Ferro;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Analysing headlines as a way of downsizing news corpora: Evidence from an Arabic-English comparable corpus of newspaper articles [J] . Haider Ahmad S., Hussein Riyad F. Literary & linguistic computing . 2020,第4期

机译：分析头条新闻作为缩小新闻学习的方式：来自阿拉伯语 - 英语的证据报纸文章
2. ANALYSIS: Aramco reheats old news in bid to generate helpful headlines before IPO [J] . Gas Matters Today group Gas Matters Today . 2019,第Octa30期

机译：分析：Aramco在竞标之前重新加热旧新闻，以在IPO之前产生乐于助人的头条新闻
3. Computer-generated Conversation Using Newspaper Headline [J] . Eriko Yoshimura, Misako Imono, Seiji Tsuchiya, 计算机技术与应用：英文 . 2013,第008期

机译：利用报纸头条进行的计算机对话
4. GoodNewsEveryone: A Corpus of News Headlines Annotated with Emotions, Semantic Roles, and Reader Perception [C] . Laura Bostan, Evgeny Kim, Roman Klinger International Conference on Language Resources and Evaluation . 2020

机译：Goodnewseveryone：一种新闻头条的语料库，用情感，语义角色和读者感知诠释
5. Knowing Her Name: The Framing of Sexual Assault Victims and Assailants in News Media Headlines [D] . Webb, Tessa. 2020

机译：知道她的名字：新闻媒体头条新闻的性侵犯受害者和攻击者的框架
6. The SFU Opinion and Comments Corpus: A Corpus for the Analysis of Online News Comments [O] . Varada Kolhatkar, Hanhan Wu, Luca Cavasso, -1

机译：SFU意见和评论语料库：分析在线新闻评论的语料库
7. Generating an Entailment Corpus from News Headlines. 2005 [O] . John Burger, Lisa Ferro 2012

机译：从新闻头条生成蕴含语料库。 2005年

Generating an Entailment Corpus from News Headlines

摘要

著录项

相似文献

相关主题

期刊订阅