首页> 外国专利> Document key phrase extraction method

Document key phrase extraction method

机译:文档关键词提取方法

摘要

A computer-implemented method of extracting key phrases from a document is disclosed comprising the steps of accessing a repository comprising linked subjects, the repository comprising first and second data structures representing the relationship between said subjects using different representation criteria; pruning the first data structure by removing links between subjects based on a further relationship between said subjects in the second data structure; matching phrases in said document to subjects in the pruned first data structure; further pruning the pruned first data structure by removing unmatched subjects that are not linked to matched subjects; determining a ranking for each matched subject; and selecting key phrases using the determined subject rankings. A computer program for implementing the steps of this method when executed on a computer is also disclosed.
机译:公开了一种从文档中提取关键短语的计算机实现的方法,该方法包括以下步骤:访问包括链接的主题的存储库,该存储库包括使用不同的表示标准来表示所述主题之间的关系的第一和第二数据结构;通过基于第二数据结构中所述主题之间的进一步关系,通过移除主题之间的链接来修剪第一数据结构;使所述文档中的短语与修剪的第一数据结构中的主题匹配;通过删除未链接到匹配主题的不匹配主题,进一步修剪已修剪的第一数据结构;确定每个匹配主题的排名;然后使用确定的主题排名来选择关键短语。还公开了一种用于在计算机上执行该方法的步骤的计算机程序。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号