首页> 外国专利> Method and system for fast similarity computation in high dimensional space

Method and system for fast similarity computation in high dimensional space

机译:高维空间中快速相似度计算的方法和系统

摘要

Method, system, and programs for computing similarity. Input data is first received from one or more data sources and then analyzed to obtain an input feature vector that characterizes the input data. An index is then generated based on the input feature vector and is used to archive the input data, where the value of the index is computed based on an improved Johnson-Lindenstrass transformation (FJLT) process. With the improved FJLT process, first, the sign of each feature in the input feature vector is randomly flipped to obtain a flipped vector. A Hadamard transformation is then applied to the flipped vector to obtain a transformed vector. An inner product between the transformed vector and a sparse vector is then computed to obtain a base vector, based on which the value of the index is determined.
机译:用于计算相似度的方法,系统和程序。首先从一个或多个数据源接收输入数据,然后对其进行分析以获得表征输入数据的输入特征向量。然后,基于输入特征向量生成索引,并将其用于存档输入数据,其中,基于改进的Johnson-Lindenstrass变换(FJLT)过程计算索引的值。通过改进的FJLT处理,首先,将输入特征向量中每个特征的符号随机翻转以获得翻转向量。然后将Hadamard变换应用于翻转后的矢量以获得变换后的矢量。然后,计算转换后的向量和稀疏向量之间的内积,以获得基向量,基于该基向量确定索引的值。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号