首页> 外国专利> RANDOM DRAW FOREST INDEX STRUCTURE FOR SEARCHING LARGE SCALE UNSTRUCTURED DATA

RANDOM DRAW FOREST INDEX STRUCTURE FOR SEARCHING LARGE SCALE UNSTRUCTURED DATA

机译:用于搜索大型非结构化数据的随机草绘森林索引结构

摘要

System and method of generating an index structure for indexing a plurality of unstructured data objects, including: generating a set of compact feature vectors, the set including a compact feature vector for each of the data objects, the compact feature vector for each data object including a sequence of hashed values that represent the data object; generating a plurality of twisted compact feature vector sets for each of set of compact feature vectors, each of the twisted compact feature vector sets being generated by applying a respective random shuffling permutation to the set of compact feature vectors; and for each twisted compact feature vector set, generating an index for the data objects in which the data objects are slotted based on sequences of hashed values in the twisted compact feature vector set.
机译:生成用于索引多个非结构化数据对象的索引结构的系统和方法,包括:生成一组紧凑特征向量,该集合包括用于每个数据对象的紧凑特征向量,用于每个数据对象的紧凑特征向量包括代表数据对象的哈希值序列;为每个紧凑特征向量集合产生多个扭曲的紧凑特征向量集合,每个扭曲的紧凑特征向量集合是通过将相应的随机改组排列应用于紧凑特征向量集合而生成的;对于每个扭曲紧凑特征向量集,基于扭曲紧凑特征向量集中的哈希值序列,为数据对象生成索引,在该索引中插入数据对象。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号