首页> 外国专利> MTS SKETCH FOR ACCURATE ESTIMATION OF SET-EXPRESSION CARDINALITIES FROM SMALL SAMPLES

MTS SKETCH FOR ACCURATE ESTIMATION OF SET-EXPRESSION CARDINALITIES FROM SMALL SAMPLES

机译:MTS草图,用于从小样本精确估计集合表达基数

摘要

A computer implemented method of estimating a cardinality of a stream, comprising: receiving a query for estimating a cardinality of a stream comprising a plurality of elements, obtaining a sample comprising a group of the plurality of elements randomly sampled from the respective stream, computing a first and second data structures for the sample used to compute an estimated sample cardinality of the sample and a ratio indicative of a proportion between the estimated sample cardinality and the estimated cardinality of the stream and computing the estimated cardinality of the stream by applying the ratio to the estimated sample cardinality. Where the first data structure comprises a plurality of maximal hash values computed for the sample using a plurality of hash functions and the second data structure comprises a fixed- size subset of the elements having a minimal hash value among the elements of the group.
机译:一种估计流的基数的计算机实现的方法,包括:接收用于估计包括多个元素的流的基数的查询,获得包括从各个流中随机采样的多个元素的组的样本,计算用于样本的第一和第二数据结构,用于计算样本的估计样本基数和表示估计样本基数与流的估计基数之间的比例的比率,并通过将比率应用于估计的样本基数。其中第一数据结构包括使用多个哈希函数为样本计算的多个最大哈希值,第二数据结构包括该组元素中具有最小哈希值的元素的固定大小子集。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号