Sampling Without Compromising Accuracy in Adaptive Data Analysis

Benjamin Fish; Lev Reyzin; Benjamin I. P. Rubinstein

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Sampling Without Compromising Accuracy in Adaptive Data Analysis

【24h】

Sampling Without Compromising Accuracy in Adaptive Data Analysis

机译：自适应数据分析中的采样而不会影响准确性

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, we study how to use sampling to speed up mechanisms for answering adaptive queries into datasets without reducing the accuracy of those mechanisms. This is important to do when both the datasets and the number of queries asked are very large. In particular, we describe a mechanism that provides a polynomial speed-up per query over previous mechanisms, without needing to increase the total amount of data required to maintain the same generalization error as before. We prove that this speed-up holds for arbitrary statistical queries. We also provide an even faster method for achieving statistically-meaningful responses wherein the mechanism is only allowed to see a constant number of samples from the data per query. Finally, we show that our general results yield a simple, fast, and unified approach for adaptively optimizing convex and strongly convex functions over a dataset.

机译：在这项工作中，我们研究如何使用采样来加快将自适应查询回答到数据集中的机制，而又不降低这些机制的准确性。当数据集和所查询的数量都很大时，这样做很重要。特别是，我们描述了一种机制，该机制在以前的机制之上为每个查询提供了多项式加速，而无需增加保持与以前相同的泛化误差所需的数据总量。我们证明，这种加速适用于任意统计查询。我们还提供了一种更快的方法来实现具有统计意义的响应，其中仅允许该机制从每个查询的数据中看到恒定数量的样本。最后，我们证明了我们的一般结果产生了一种简单，快速且统一的方法，用于自适应优化数据集上的凸函数和强凸函数。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2020年第2期|共22页
作者
Benjamin Fish; Lev Reyzin; Benjamin I. P. Rubinstein;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
Adaptive data analysisdifferential privacysublinear-time algorithms;

机译：自适应数据分析差分隐私亚线性时间算法;

相似文献

外文文献
中文文献
专利

1. An adaptive fused sampling approach of high-accuracy data in the presence of low-accuracy data [J] . Gahrooei Mostafa Reisi, Paynabar Kamaran, Pacella Massimo, IIE Transactions . 2019,第11期

机译：存在低精度数据时的高精度数据自适应融合采样方法
2. An adaptive fused sampling approach of high-accuracy data in the presence of low-accuracy data [J] . Gahrooei Mostafa Reisi, Paynabar Kamaran, Pacella Massimo, IIE Transactions . 2019,第11期

机译：在低精度数据存在下高精度数据的自适应融合采样方法
3. A method for extracting plant roots from soil which facilitates rapid sample processing without compromising measurement accuracy. [J] . Metcalfe D B, Williams M, Aragao L E O C, The New Phytologist . 2007,第3期

机译：一种从土壤中提取植物根的方法，该方法有助于快速进行样品处理而不会影响测量精度。
4. App Review Analysis via Active Learning: Reducing Supervision Effort without Compromising Classification Accuracy [C] . Venkatesh T. Dhinakaran, Raseshwari Pulle, Nirav Ajmeri, IEEE International Requirements Engineering Conference . 2018

机译：通过主动学习应用审查分析：降低监督工作而不影响分类准确性
5. Cryptography for secure and private databases: Enabling practical data access without compromising privacy. [D] . Green, Matthew Daniel. 2009

机译：安全和私有数据库的密码术：在不损害隐私的情况下实现实际数据访问。
6. Protecting Privacy of Shared Epidemiologic Data without Compromising Analysis Potential [O] . John Cologne, Eric J. Grant, Eiji Nakashima, 2012

机译：在不影响分析潜力的情况下保护共享流行病学数据的隐私
7. Operating within Planetary Boundaries without compromising well-being? A Data Envelopment Analysis approach [O] . Michael Ehrenstein, Raúl Calvo-Serrano, Ángel Galán-Martín, 2020

机译：在行星边界内运作而不会妥协福祉？数据包络分析方法
8. Accuracy of Remotely Sensed Data: Sampling and Analysis Procedures [R] . Congalton, R. G. , Oderwald, R. G. , Mead, R. A. 1982

机译：远程感知数据的准确性：抽样和分析程序

Sampling Without Compromising Accuracy in Adaptive Data Analysis

摘要

著录项

相似文献

相关主题

期刊订阅