Finding Heavy Hitters from Lossy or Noisy Data

机译：从有损或嘈杂的数据中找到沉重的打击

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivated by Dvir et al. and Wigderson and Yehudayoff [3,10], we examine the question of discovering the set of heavy hitters of a distribution on strings (i.e., the set of strings with a certain minimum probability) from lossy or noisy samples. While the previous work concentrated on finding both the set of most probable elements and their probabilities, we consider enumeration, the problem of just finding a list that includes all the most probable elements without associated probabilities. Unlike Wigderson and Yehudayoff [10], we do not assume the underlying distribution has small support size, and our time bounds are independent of the support size. For the enumeration problem, we give a polynomial time algorithm for the lossy sample model for any constant erasure probability μ < 1, and a quasi-polynomial algorithm for the noisy sample model for any noise probability v < 1/2 of flipping bits. We extend the lower bound for the number of samples required for the reconstruction problem from [3] to the enumeration problem to show that when μ = 1 - o(l), no polynomial time algorithm exists.

机译：由Dvir等人激发。以及Wigderson和Yehudayoff [3,10]，我们研究了一个问题，即从有损或嘈杂的样本中发现字符串分布的重击手集（即，具有一定最小概率的字符串集）。虽然先前的工作集中在查找最可能的元素及其概率的集合上，但我们考虑了枚举，即仅查找一个包含所有最可能的元素但没有关联概率的列表的问题。与Wigderson和Yehudayoff [10]不同，我们不假定基础分布的支持量很小，并且我们的时间范围与支持量无关。对于枚举问题，对于任意恒定擦除概率μ<1，我们给出了有损样本模型的多项式时间算法，对于任何噪声概率v <1/2的翻转位，针对有噪声样本模型给出了拟多项式算法。我们将重构问题所需的样本数下限从[3]扩展到枚举问题，以表明当μ= 1-o（l）时，不存在多项式时间算法。

著录项

来源
《Approximation, randomization, and combinatorial optimization : Algorithms and techniques》|2013年|347-362|共16页
会议地点 Berkeley CA(US)
作者
Lucia Batman; Russell Impagliazzo; Cody Murray; Ramamohan Paturi;
展开▼
作者单位

Department of Computer Science and Engineering, University of California, San Diego;

Department of Computer Science and Engineering, University of California, San Diego;

Department of Computer Science and Engineering, University of California, San Diego;

Department of Computer Science and Engineering, University of California, San Diego;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Population Recovery; Enumeration of Heavy Hitters; Learning Discrete Distributions;

机译：人口恢复；枚举重击学习离散分布;

相似文献

外文文献
中文文献
专利

1. Probabilistic Lossy Counting: An efficient algorithm for finding heavy hitters [J] . Xenofontas Dimitropoulos, Paul Hurley, Andreas Kind Computer communication review . 2008,第1期

机译：概率有损计数：查找重击手的有效算法
2. Public Review for Probabilistic Lossy Counting: An efficient algorithm for finding heavy hitters [J] . Pablo Rodriguez Computer communication review . 2008,第1期

机译：概率有损计数的公共审查：查找重击手的有效算法
3. Finding Cardinality Heavy-Hitters in Massive Traffic Data and Its Application to Anomaly Detection [J] . Keisuke ISHIBASHI, Tatsuya MORI, Ryoichi KAWAHARA, IEICE Transactions on Communications . 2008,第5期

机译：海量交通数据中基数重击者的发现及其在异常检测中的应用
4. Finding Heavy Hitters from Lossy or Noisy Data [C] . Lucia Batman, Russell Impagliazzo, Cody Murray, International Workshop on Randomization and Computation . 2013

机译：从有损或嘈杂的数据找到沉重的击球手
5. Progressive source -channel coding for multimedia transmission over noisy and lossy channels with and without feedback [D] . Chande, Vinay 2004

机译：渐进式源信道编码，用于在有噪声和无反馈的情况下在有噪声和有损耗信道上进行多媒体传输
6. Hit series selection in noisy HTS data: clustering techniques statistical tests and data visualisations [O] . Christoph Müller, Daniel Ormsby, Isabella Feierberg, 2014

机译：嘈杂的HTS数据中的热门系列选择：聚类技术统计测试和数据可视化
7. Finding Subcube Heavy Hitters in Data Streams [O] . Kveton, Branislav, Muthukrishnan, S., Vu, Hoa T. 2017

机译：在数据流中寻找子立方体重型击球手

Finding Heavy Hitters from Lossy or Noisy Data

摘要

著录项

相似文献

相关主题

期刊订阅