首页> 外国专利> Iterative technique for phrase query formation and an information retrieval system employing same

Iterative technique for phrase query formation and an information retrieval system employing same

机译:短语查询形成的迭代技术及采用该迭代技术的信息检索系统

摘要

An information retrieval system and method are provided in which an operator inputs (110) one or more query words which are used to determine a search key (120) for searching (130) through a corpus of documents, and which returns ( 140) any matches between the search key and the corpus of documents as a phrase containing the word data matching the search key (the query word(s)), a non-stop (content) word next adjacent to the matching word data, and all intervening stop-words between the matching word data and the next adjacent non-stop word. The operator, after reviewing one or more of the returned phrases can then use one or more of the next adjacent non-stop-words as new query words to reformulate the search key ( 150, 160, 170) and perform a subsequent search through the document corpus. This process can be conducted iteratively, until the appropriate documents of interest are located. The additional non-stop-words from each phrase are preferably aligned with each other (e.g., by columnation) to ease viewing of the " new" content words. IMAGE
机译:提供了一种信息检索系统和方法,其中操作员输入(110)一个或多个查询词,该查询词用于确定用于通过文档全集进行搜索(130)的搜索关键字(120),并返回(140)任意一个。在搜索关键字和文档语料库之间进行匹配,以作为短语,其中包含与搜索关键字(查询词)匹配的单词数据,与该匹配单词数据相邻的不间断(内容)单词以及所有中间的单词匹配的单词数据和下一个相邻的非停止单词之间的单词。在查看了一个或多个返回的短语之后,操作员可以将一个或多个下一个相邻的非停止词用作新查询词,以重新构造搜索关键字(150、160、170)并通过文档语料库。此过程可以迭代进行,直到找到合适的相关文档为止。来自每个短语的另外的不停词优选地彼此对准(例如,通过分栏),以易于观看“新”内容词。 <图像>

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号