首页> 外国专利> METHOD AND SYSTEM FOR IDENTIFYING SET OF QUERY-RELEVANT KEYWORDS COMPRISED IN ONE OR MORE DOCUMENTS OBTAINED FROM QUERY, WHICH NEED NOT COMPRISE THE KEYWORDS

METHOD AND SYSTEM FOR IDENTIFYING SET OF QUERY-RELEVANT KEYWORDS COMPRISED IN ONE OR MORE DOCUMENTS OBTAINED FROM QUERY, WHICH NEED NOT COMPRISE THE KEYWORDS

机译:识别由查询获得的一个或多个文档中包含的,与查询相关的关键字集的方法和系统

摘要

PPROBLEM TO BE SOLVED: To provide a system and method for identifying query-related keywords in documents found in a search using latent semantic analysis. PSOLUTION: A term-weight matrix M comprising one or more document term-weight vectors d is created. Each document term-weight vector d comprises information on frequency in one or more documents obtained from terms in a query. An expanded query term-weight vector qSBexpanded/SBis created from a query term-weight vector q and the term-weight matrix M. With the expanded query term-weight vector qSBexpanded/SBand the document term-weight vectors d, a set of keywords identified as terms related to the query and also comprised in at least one of the documents is located. The query need not comprise the keywords. PCOPYRIGHT: (C)2006,JPO&NCIPI
机译:

要解决的问题:提供一种使用潜在语义分析来识别在搜索中找到的文档中与查询相关的关键字的系统和方法。解决方案:创建包含一个或多个文档术语权重向量d的术语权重矩阵M。每个文档术语权重向量d包含有关从查询中的术语获得的一个或多个文档中的频率的信息。根据查询术语权重向量q和术语权重矩阵M创建扩展的查询术语权重向量q expand 。使用扩展的查询术语权重向量q expanded 和文档术语权重矢量d一起定位,这些关键字集被标识为与查询相关的术语,并且也包含在至少一个文档中。该查询不必包含关键字。

版权:(C)2006,JPO&NCIPI

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号