首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Sampling Without Compromising Accuracy in Adaptive Data Analysis
【24h】

Sampling Without Compromising Accuracy in Adaptive Data Analysis

机译:自适应数据分析中的采样而不会影响准确性

获取原文
           

摘要

In this work, we study how to use sampling to speed up mechanisms for answering adaptive queries into datasets without reducing the accuracy of those mechanisms. This is important to do when both the datasets and the number of queries asked are very large. In particular, we describe a mechanism that provides a polynomial speed-up per query over previous mechanisms, without needing to increase the total amount of data required to maintain the same generalization error as before. We prove that this speed-up holds for arbitrary statistical queries. We also provide an even faster method for achieving statistically-meaningful responses wherein the mechanism is only allowed to see a constant number of samples from the data per query. Finally, we show that our general results yield a simple, fast, and unified approach for adaptively optimizing convex and strongly convex functions over a dataset.
机译:在这项工作中,我们研究如何使用采样来加快将自适应查询回答到数据集中的机制,而又不降低这些机制的准确性。当数据集和所查询的数量都很大时,这样做很重要。特别是,我们描述了一种机制,该机制在以前的机制之上为每个查询提供了多项式加速,而无需增加保持与以前相同的泛化误差所需的数据总量。我们证明,这种加速适用于任意统计查询。我们还提供了一种更快的方法来实现具有统计意义的响应,其中仅允许该机制从每个查询的数据中看到恒定数量的样本。最后,我们证明了我们的一般结果产生了一种简单,快速且统一的方法,用于自适应优化数据集上的凸函数和强凸函数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号