首页> 外国专利> Efficient similarity search and classification via rank aggregation

Efficient similarity search and classification via rank aggregation

机译:通过等级汇总进行有效的相似度搜索和分类

摘要

A system, method, and computer program product for automatically performing similarity search, classification, and other nearest-neighbor search-based applications using rank aggregation. The invention reduces the &egr;-approximate Euclidean nearest neighbor problem to the problem of finding the candidate with the best median rank in an election with n candidates and O(&egr;−2logn) voters. ;Database elements and a query are points projected in a multidimensional Euclidean space, and coordinates in the space serve as independent “voters” that rank database elements by their closeness to the query coordinate. The rankings are aggregated and the winners are the database elements with the highest aggregated ranks. ;Combined with dimensionality reduction, the invention is a simple, efficient, database-friendly scheme for generating a &egr;-approximate nearest neighbor answer. ;The invention also enables searching of categorical vs. mere numerical features by sorting the database according to each feature and aggregating the resulting rankings.
机译:一种系统,方法和计算机程序产品,用于使用等级汇总自动执行相似性搜索,分类和其他基于最近邻居搜索的应用程序。本发明将&egr-近似欧几里德最近邻问题简化为在具有n个候选者和O(&egr; − 2 logn)选民的选举中找到具有最佳中位秩的候选者的问题。 ;数据库元素和查询是投影在多维欧几里得空间中的点,并且空间中的坐标充当独立的“投票者”。根据数据库元素与查询坐标的接近程度对其进行排名。排名是汇总的,获胜者是汇总排名最高的数据库元素。与降维相结合,本发明是一种简单,有效,数据库友好的方案,用于产生近似的最接近的邻居答案。本发明还能够通过根据每个特征对数据库进行排序并汇总所得到的排名来搜索分类特征或单纯的数字特征。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号