首页> 外国专利> METHOD AND SYSTEM FOR RETRIEVING, DETECTING AND IDENTIFYING MAIN CLUSTER AND OUTLIER CLUSTER IN LARGE SCALE DATABASE, RECORDING MEDIUM AND SERVER

METHOD AND SYSTEM FOR RETRIEVING, DETECTING AND IDENTIFYING MAIN CLUSTER AND OUTLIER CLUSTER IN LARGE SCALE DATABASE, RECORDING MEDIUM AND SERVER

机译:在大型数据库,记录介质和服务器中检索,检测和识别主要集群和异常集群的方法和系统

摘要

PROBLEM TO BE SOLVED: To provide a method and a system for detecting, retrieving and identifying a main cluster and an outlier cluster in a large scale database, and to provide a recording medium and a server. SOLUTION: This method includes a step for generating a document matrix from a preceding document by using at least one attribute, a step for generating a residual matrix scaled on the basis of the document matrix from a prescribed function, a step for performing singular value decomposition to obtain a base vector corresponding to a maximum singular value, a step for reconstructing the residual matrix, dynamically scaling the reconstructed residual matrix and obtaining another base vector, a step for repeating from the singular value decomposition step to the reconstruction step to generate a set of prescribed base vectors, and a step for performing dimensional reduction of the document matrix and detecting, retrieving and identifying a document in a database.
机译:解决的问题:提供一种用于检测,检索和识别大型数据库中的主群集和离群群集的方法和系统,并提供记录介质和服务器。解决方案:该方法包括以下步骤:通过使用至少一个属性从先前的文档生成文档矩阵;从生成的函数生成基于文档矩阵缩放的残差矩阵的步骤;执行奇异值分解的步骤为了获得对应于最大奇异值的基本向量,用于重建残差矩阵,动态缩放所重建的残差矩阵并获得另一个基本向量的步骤,用于从奇异值分解步骤重复到重建步骤以生成集合的步骤规定的基本向量的集合,以及用于执行文档矩阵的降维并在数据库中检测,检索和标识文档的步骤。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号