首页> 外国专利> Partitioning log records based on term frequency and type for selective skipping during full-text searching

Partitioning log records based on term frequency and type for selective skipping during full-text searching

机译:根据术语频率和类型对日志记录进行分区,以便在全文搜索期间选择性跳过

摘要

A log record from a host machine node includes terms. Frequency of occurrence of the terms across a stream of log records is determined. Based on the frequency satisfying a threshold, a Bloom filter vector is selected from among a plurality of Bloom filter vectors based on the frequency, the Bloom filter vector is updated based on the terms, and an identifier for the log record is stored with an association to the Bloom filter vector. In contrast, based on the frequency of occurrence not satisfying the defined frequency range, a type identifier is identified based on the terms, a Bloom filter vector is selected from among the plurality of Bloom filter vectors based on the type identifier, the Bloom filter vector is updated based on the terms, and an identifier for the log record is stored with an association to the Bloom filter vector.
机译:来自主机节点的日志记录包含术语。确定术语在整个日志记录流中出现的频率。基于满足阈值的频率,基于该频率从多个布隆过滤器向量中选择布隆过滤器向量,基于这些术语更新布隆过滤器向量,并且将日志记录的标识符与关联存储在一起。到Bloom过滤器矢量。相反,基于不满足定义的频率范围的出现频率,基于术语来识别类型标识符,基于类型标识符,布隆过滤器矢量从多个布隆过滤器矢量中选择布隆过滤器矢量。会根据这些条款更新,并存储日志记录的标识符以及与Bloom过滤器向量的关联。

著录项

  • 公开/公告号US9892166B2

    专利类型

  • 公开/公告日2018-02-13

    原文格式PDF

  • 申请/专利权人 CA INC.;

    申请/专利号US201414510401

  • 发明设计人 SREENIVAS GUKAL;

    申请日2014-10-09

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 12:57:55

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号