首页> 外国专利> Text summarization using relevance measures and latent semantic analysis

Text summarization using relevance measures and latent semantic analysis

机译:使用相关度量和潜在语义分析的文本摘要

摘要

Text summarizers using relevance measurement technologies and latent semantic analysis techniques provide accurate and useful summarization of the contents of text documents. Generic text summaries may be produced by ranking and extracting sentences from original documents; broad coverage of document content and decreased redundancy may simultaneously be achieved by constructing summaries from sentences that are highly ranked and different from each other. In one embodiment, conventional Information Retrieval (IR) technologies may be applied in a unique way to perform the summarization; relevance measurement, sentence selection, and term elimination may be repeated in successive iterations. In another embodiment, a singular value decomposition technique may be applied to a terms-by-sentences matrix such that all the sentences from the document may be projected into the singular vector space; a text summarizer may then select sentences having the largest index values with the most important singular vectors as part of the text summary.
机译:使用关联性度量技术和潜在语义分析技术的文本摘要生成器可以对文本文档的内容进行准确而有用的汇总。通用文本摘要可以通过对原始文档中的句子进行排名和提取来产生;通过从排名较高且彼此不同的句子中构建摘要,可以同时实现文档内容的广泛覆盖和减少的冗余。在一个实施例中,可以以独特的方式来应用常规的信息检索(IR)技术来执行汇总;然而,在此不赘述。相关性测量,句子选择和术语消除可以在连续的迭代中重复进行。在另一个实施例中,可以将奇异值分解技术应用于逐项词语矩阵,从而可以将来自文档的所有句子都投影到奇异向量空间中。文本摘要器然后可以选择具有最大索引值和最重要的奇异矢量的句子作为文本摘要的一部分。

著录项

  • 公开/公告号US2002138528A1

    专利类型

  • 公开/公告日2002-09-26

    原文格式PDF

  • 申请/专利权人 GONG YIHONG;LIU XIN;

    申请/专利号US20010817591

  • 发明设计人 YIHONG GONG;XIN LIU;

    申请日2001-03-26

  • 分类号G06F17/21;G06F17/27;

  • 国家 US

  • 入库时间 2022-08-22 00:51:56

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号