首页> 外文会议>2017 ACM/IEEE Joint Conference on Digital Libraries >Building Entity-Centric Event Collections
【24h】

Building Entity-Centric Event Collections

机译:建立以实体为中心的事件集合

获取原文
获取原文并翻译 | 示例

摘要

Web archives preserve an unprecedented abundance of materials regarding major events and transformations in our society. In this paper, we present an approach for building event-centric sub-collections from such large archives, which includes not only the core documents related to the event itself but, even more importantly, documents describing related aspects (e.g., premises and consequences). This is achieved by 1) identifying relevant concepts and entities from a knowledge base, and 2) detecting their mentions in documents, which are interpreted as indicators for relevance. We extensively evaluate our system on two diachronic corpora, the New York Times Corpus and the US Congressional Record, and we test its performance on the TREC KBA Stream corpus, a large and publicly available web archive.
机译:网络档案馆保留了有关我们社会重大事件和变革的空前丰富的资料。在本文中,我们提出了一种从如此大的档案中构建以事件为中心的子集合的方法,该方法不仅包括与事件本身相关的核心文档,而且更重要的是,它描述了相关方面(例如前提和后果)的文档。 。这可以通过以下方式实现:1)从知识库中识别相关的概念和实体,以及2)在文档中检测到它们的提及,这些注释和解释被视为相关性的指标。我们在两个历时性语料库(纽约时报语料库和美国国会记录)上广泛评估了我们的系统,并在TREC KBA Stream语料库(大型且公开的网络档案)上测试了其性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号