首页> 外国专利> Method and system of ranking and clustering for document indexing and retrieval

Method and system of ranking and clustering for document indexing and retrieval

机译:文档索引和检索的排序和聚类方法和系统

摘要

A relevancy ranking and clustering method and system that determines the relevance of a document relative to a user's query using a similarity comparison process. Input queries are parsed into one or more query predicate structures using an ontological parser. The ontological parser parses a set of known documents to generate one or more document predicate structures. A comparison of each query predicate structure with each document predicate structure is performed to determine a matching degree, represented by a real number. A multilevel modifier strategy is implemented to assign different relevance values to the different parts of each predicate structure match to calculate the predicate structure's matching degree. The relevance of a document to a user's query is determined by calculating a similarity coefficient, based on the structures of each pair of query predicates and document predicates. Documents are autonomously clustered using a self-organizing neural network that provides a coordinate system that makes judgments in a non-subjective fashion.
机译:相关性排名和聚类方法和系统,用于使用相似性比较过程确定文档相对于用户查询的相关性。使用本体解析器将输入查询解析为一个或多个查询谓词结构。本体解析器解析一组已知文档以生成一个或多个文档谓词结构。进行每个查询谓词结构与每个文档谓词结构的比较以确定匹配程度,用实数表示。实施了多级修饰符策略,为每个谓词结构匹配的不同部分分配不同的相关性值,以计算谓词结构的匹配度。基于每对查询谓词和文档谓词的结构,通过计算相似性系数来确定文档与用户查询的相关性。使用自组织神经网络将文档自动聚类,该神经网络提供以非主观方式进行判断的坐标系。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号