首页> 外国专利> Estimating most frequent values for a data set

Estimating most frequent values for a data set

机译:估计数据集的最频繁值

摘要

Provided are techniques for estimating most frequent values. A sample of values made up of rows is received from each of multiple nodes. The sample of values from each of the multiple nodes are aggregated to generate a sample table storing the rows. A descending list of most frequent values and associated frequencies is obtained using the sample table. Most frequent values are pruned from the descending list whose associated frequencies are below a minimum absolute frequency. The remaining most frequent values are extrapolated to reflect a data set.
机译:提供了用于估计最频繁值的技术。从多个节点中的每个节点接收由行组成的值的样本。来自多个节点中的每个节点的值样本被汇总以生成存储行的样本表。使用样本表可获得最频繁值和相关频率的降序列表。最频繁的值从其相关频率低于最小绝对频率的降序列表中删除。剩余的最频繁值被外推以反映数据集。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号