首页> 外国专利> Retrieving, detecting and identifying major and outlier clusters in a very large database

Retrieving, detecting and identifying major and outlier clusters in a very large database

机译:检索,检测和识别大型数据库中的主要和离群群集

摘要

The present invention discloses a method, a computer system, a computer readable medium and a sever. The method of the present invention comprises steps of; creating said document matrix from said documents using at least one attribute; creating a scaled residual matrix based on said document matrix using a predetermined function; executing singular value decomposition to obtain a basis vector corresponding to the largest singular value; re-constructing said residual matrix and scaling dynamically said re-constructed residual matrix to obtain another basis vector; repeating said singular value decomposition step to said re-constructing step to create a predetermined set of basis vector; and executing reduction of said document matrix to perform detection, retrieval and identification of said documents in said database.
机译:本发明公开了一种方法,计算机系统,计算机可读介质和服务器。本发明的方法包括以下步骤:使用至少一个属性从所述文档创建所述文档矩阵;使用预定函数基于所述文档矩阵创建缩放的残差矩阵;执行奇异值分解以获得对应于最大奇异值的基矢量;重建所述残差矩阵并动态缩放所述重建残差矩阵以获得另一基向量;将所述奇异值分解步骤重复到所述重构步骤以创建预定的基础矢量集合;执行所述文档矩阵的约简,以对所述数据库中的所述文档进行检测,检索和识别。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号