【24h】

Summarizing Definition from Wikipedia

机译:总结维基百科的定义

获取原文

摘要

Wikipedia provides a wealth of knowledge, where the first sentence, infobox (and relevant sentences), and even the entire document of a wiki article could be considered as diverse versions of summaries (definitions) of the target topic. We explore how to generate a series of summaries with various lengths based on them. To obtain more reliable associations between sentences, we introduce wiki concepts according to the internal links in Wikipedia. In addition, we develop an extended document concept lattice model to combine wiki concepts and non-textual features such as the outline and infobox. The model can concatenate representative sentences from non-overlapping salient local topics for summary generation. We test our model based on our annotated wiki articles which topics come from TREC-QA 2004-2006 evaluations. The results show that the model is effective in summarization and definition QA.
机译:维基百科提供了丰富的知识,其中第一句话,信息框(和相关句子),甚至是Wiki文章的整个文档,都可以视为目标主题摘要(定义)的不同版本。我们探索如何基于它们生成一系列具有各种长度的摘要。为了获得句子之间更可靠的关联,我们根据Wikipedia中的内部链接介绍Wiki概念。此外,我们开发了扩展的文档概念格模型,以结合Wiki概念和非文本功能(例如,大纲和信息框)。该模型可以将非重叠显着本地主题中的代表性句子连接起来,以进行摘要生成。我们根据带注释的Wiki文章测试模型,该文章来自TREC-QA 2004-2006评估。结果表明,该模型在总结和定义质量保证方面是有效的。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号