首页> 外国专利> HAMMING SPACE-BASED APPROXIMATE QUERY METHOD AND STORAGE MEDIUM

HAMMING SPACE-BASED APPROXIMATE QUERY METHOD AND STORAGE MEDIUM

机译:基于汉明空间的近似查询方法和存储介质

摘要

A Hamming space-based approximate query method and a storage medium. The Hamming space-based approximate query method comprises the steps of: mapping all records and query data in an original database into hash binary vectors in a Hamming space to obtain a hash database; performing column reordering on binary data in the hash database; establishing an index structure for data newly generated after column reordering, the index structure comprising a histogram and an inverted hash index; and performing parsing and querying, and allocating a corresponding query threshold for each data segmentation. According to the method, the inclination of data can be well utilized, and threshold allocation is performed according to the inclination, so as to filter out a large amount of non-result data; and a histogram index structure and an inverted hash index structure are used, dimension reordering is performed according to different inclinations of data, and data columns having a large inclination are put together, so as to the utilize inclinations of the data more effectively, thereby improving approximate query efficiency.
机译:基于汉明空间的近似查询方法和存储介质。基于汉明的近似查询方法包括以下步骤:将原始数据库中的所有记录和查询数据映射到汉明空间中的哈希二进制向量中,以获得哈希数据库;在哈希数据库中执行关于二进制数据的列重新排序;建立新生成的数据重新排序新生成的数据的索引结构,该索引结构包括直方图和倒散列索引;并执行解析和查询,并为每个数据分段分配相应的查询阈值。根据该方法,可以很好地利用数据的倾斜,并且根据倾斜度执行阈值分配,以便过滤输出大量的非结果数据;使用直方图索引结构和倒散列索引结构,根据数据的不同倾斜度来执行维度重新排序,并且将具有大倾斜度的数据列放在一起,以便更有效地利用数据的倾斜度,从而改善近似查询效率。

著录项

  • 公开/公告号WO2021036070A1

    专利类型

  • 公开/公告日2021-03-04

    原文格式PDF

  • 申请/专利权人 SHENZHEN INSTITUTE OF COMPUTING SCIENCES;

    申请/专利号WO2019CN122454

  • 发明设计人 QIN JIANBIN;WANG YAOSHU;

    申请日2019-12-02

  • 分类号G06F16/22;

  • 国家 CN

  • 入库时间 2022-08-24 17:33:09

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号