首页> 外文会议>Text Analysis Conference >Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance
【24h】

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

机译:TAC 2009年清华大学:通过信息距离总结多文件

获取原文

摘要

This paper presents our extractive summarization systems at the update summarization track of TAC 2009. This system is based on our newly developed document summarization framework under the theory of conditional information distance among many objects. The best summary is defined in this paper to be the one which has the minimum information distance to the entire document set. The best update summary has the minimum conditional information distance to a document cluster given that a prior document cluster has already been read. Experiments on the TAC dataset have proved that our method has got a good performance in many categories.
机译:本文提出了TAC 2009年更新摘要轨道的推动摘要系统。该系统基于许多物体之间有条件信息距离理论下的新开发的文件摘要框架。在本文中定义了最佳摘要,是与整个文档集的最小信息距离的最佳摘要。鉴于已经读取了先前的文档群集,最佳更新摘要具有到文档群集的最小条件信息距离。 TAC数据集的实验证明,我们的方法在许多类别中具有良好的表现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号