首页> 外国专利> Methods for encoder interface and data search by signatures combinatorias

Methods for encoder interface and data search by signatures combinatorias

机译:通过签名组合器进行编码器接口和数据搜索的方法

摘要

A data base management system encodes information (such as the field values of a database record, or the words of a text document) so that the original information may be efficiently searched by a computer. An information object is encoded into a small "signature" or codeword. A base or "leaf" signature is computed by a known technique such as hashing. The logical intersection (AND) of each possible combination of pairs of bits of the base signature is computed, and the result is stored as one bit of a longer combinatorial signature. The bit-wise logical union (bit-OR) of the combinatorial signatures of a group of records produces a second-level combinatorial signature representing particular field values present among those records. Higher-level combinatorial signatures are computed similarly. These combinatorial signatures avoid a "saturation" problem which occurs when signatures are grouped together, and a "combinatorial error" problem which falsely indicates the existence of non-existent records, thereby significantly improving the ability to reject data not relevant to a given query. When the combinatorial signatures are stored in a hierarchical data structure, such as a B- tree index of a database management system, they provide means for more efficiently searching database records or document text by eliminating large amounts of non-matching data from further consideration.
机译:数据库管理系统对信息(例如数据库记录的字段值或文本文档的单词)进行编码,以便计算机可以有效地搜索原始信息。信息对象被编码为小的“签名”或代码字。通过诸如散列的已知技术来计算基或“叶”签名。计算基本签名的成对比特的每个可能组合的逻辑交集(AND),并将结果存储为更长的组合签名的一位。一组记录的组合签名的按位逻辑并集(bit-OR)产生一个第二级组合签名,代表这些记录之间存在的特定字段值。更高级别的组合签名的计算方式与此类似。这些组合签名避免了将签名分组在一起时发生的“饱和”问题,以及避免错误地指示不存在记录的存在的“组合错误”问题,从而显着提高了拒绝与给定查询无关的数据的能力。当组合签名存储在分层数据结构(例如数据库管理系统的B树索引)中时,它们通过消除大量不匹配的数据而为进一步提供更有效地搜索数据库记录或文档文本的手段。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号