首页> 外文会议>Proceedings of the 31st symposium on principles of database systems >Approximating and Testing k-Histogram Distributions in Sub-linear Time
【24h】

Approximating and Testing k-Histogram Distributions in Sub-linear Time

机译:亚线性时间中k直方图分布的近似测试

获取原文
获取原文并翻译 | 示例

摘要

A discrete distribution p, over [n], is a k-histogram if its probability distribution function can be represented as a piece-wise constant function with k pieces. Such a function is represented by a list of k intervals and k corresponding values. We consider the following problem: given a collection of samples from a distribution p, find a k-histogram that (approximately) minimizes the σ2 distance to the distribution p. We give time and sample efficient algorithms for this problem. We further provide algorithms that, distinguish distributions that have the property of being a k-histogram from distributions that are e-far from any A-histogram in the σ1 distance and σ2 distance respectively.
机译:如果[n]上的离散分布p的概率分布函数可以表示为k个分段的分段常数函数,则它是k个直方图。这种功能由k个间隔和k个对应值的列表表示。我们考虑以下问题:给定一个分布p的样本集合,找到一个k直方图,该k直方图(近似)使距分布p的σ2距离最小。我们为该问题提供了时间和有效的示例算法。我们进一步提供了一种算法,该算法将具有k直方图特性的分布与分别位于σ1距离和σ2距离中的任何A直方图的e-far分布区分开。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号