首页> 外国专利> Method for for multilingual information retrieval

Method for for multilingual information retrieval

机译:多语言信息检索方法

摘要

A computer-implemented method for retrieving multi-lingual information in a server interacting with a collection of documents and a set of language-specific indices is disclosed. In at least one data-storage device a set of one or more language-specific indices are defined for a collection of documents with each index including stemmed and non-stemmed versions of terms contained in the documents; A query is received from a user, with the query associated with a set of one or more target languages. Using at least one processor the query is parsed into one or more terms with each term associated with a corresponding language identifier and a stemmed version of the term. Using at least one processor the non-stemmed and stemmed versions of each term are translated into each of the target languages to define respective sets of one or more equivalent query terms. Using at least one processor a set of documents are identified from the collection of documents for each of the target languages, with each set identified based on the equivalent query term for the corresponding target language.
机译:公开了一种用于在与文档的集合和一组特定于语言的索引进行交互的服务器中检索多语言信息的计算机实现的方法。在至少一个数据存储设备中,为一组文档定义了一组一个或多个特定于语言的索引,每个索引包括文档中包含的术语的词干和非词干版本;从用户接收查询,该查询与一组一种或多种目标语言相关联。使用至少一个处理器,将该查询解析为一个或多个术语,每个术语与相应的语言标识符和该术语的词干版本相关联。使用至少一个处理器,将每个词的非词干和词干版本转换为每种目标语言,以定义一个或多个等效查询词的相应集合。使用至少一个处理器,从针对每种目标语言的文档集合中识别出一组文档,其中基于对应的目标语言的等效查询词来识别每组文档。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号