首页> 外国专利> Estimating most frequent values for a data set

Estimating most frequent values for a data set

机译：估计数据集的最频繁值

页面导航

摘要
著录项
相似文献

摘要

Provided are techniques for estimating most frequent values. A sample of values made up of rows is received from each of multiple nodes. The sample of values from each of the multiple nodes are aggregated to generate a sample table storing the rows. A descending list of most frequent values and associated frequencies is obtained using the sample table. Most frequent values are pruned from the descending list whose associated frequencies are below a minimum absolute frequency. The remaining most frequent values are extrapolated to reflect a data set.

机译：提供了用于估计最频繁值的技术。从多个节点中的每个节点接收由行组成的值的样本。来自多个节点中的每个节点的值样本被汇总以生成存储行的样本表。使用样本表可获得最频繁值和相关频率的降序列表。最频繁的值从其相关频率低于最小绝对频率的降序列表中删除。剩余的最频繁值被外推以反映数据集。

著录项

公开/公告号US10176231B2

专利类型
公开/公告日2019-01-08

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号US201615017442
发明设计人 JAMES L. FINNERTY;VENKATESH S. GOPAL;VENKANNABABU TAMMISETTI;PAUL-JOHN A. TO;
展开▼

申请日2016-02-05
分类号G06F17/30;
国家 US
入库时间 2022-08-21 12:04:24

相似文献

专利
外文文献
中文文献