首页> 外文期刊>高技术通讯(英文版) >Summarization based on physical features and logical structure of multi documents
【24h】

Summarization based on physical features and logical structure of multi documents

机译:基于多文档物理特征和逻辑结构的汇总

获取原文
获取原文并翻译 | 示例
       

摘要

With the rapid development of the Internet, multi documents summarization is becoming a very hot research topic. In order to generate a summarization that can effectively characterize the original information from documents, this paper proposes a multi documents summarization approach based on the physical features and logical structure of the document set. This method firstly clusterssimilar sentences into several Logical Topics (LTs), and then orders these topics according to their physical features of multi documents. After that, sentences used for the summarization are extracted from these LTs, and finally the summarization is generated via certain sorting algorithms. Our experiments show that the information coverage rate of our method is 8.83% higher than those methods based solely on logical structures, and 14.31% higher than Top-N method.
机译:随着Internet的飞速发展,多文档摘要已成为一个非常热门的研究主题。为了生成能够有效地描述文档中原始信息的摘要,本文提出了一种基于文档集的物理特征和逻辑结构的多文档摘要方法。该方法首先将相似的句子聚类为几个逻辑主题(LTs),然后根据它们在多文档中的物理特征对这些主题进行排序。之后,从这些LT中提取用于摘要的语句,最后通过某些排序算法生成摘要。实验表明,该方法的信息覆盖率比仅基于逻辑结构的方法高8.83%,比Top-N方法高14.31%。

著录项

  • 来源
    《高技术通讯(英文版)》 |2005年第2期|133-136|共4页
  • 作者

    Qin Bing; Liu Ting; Li Sheng;

  • 作者单位

    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, P.R.China;

    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, P.R.China;

    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, P.R.China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 chi
  • 中图分类
  • 关键词

    multi documents summarization; logical topics; physical features;

    机译:多文档摘要;逻辑主题;物理特征;
  • 入库时间 2022-08-19 03:39:34
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号