On Random Sampling over Joins

机译：在加入的随机抽样上

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A major bottleneck in implementing sampling as a primitive relational operation is the inefficiency of sampling the output of a query. It is not even known whether it is possible to generate a sample of a join tree without first evaluating the join tree completely. We undertake a detailed study of this problem and attempt to analyze it in a variety of settings. We present theoretical results explaining the difficulty of this problem and setting limits on the efficiency that can be achieved. Based on new insights into the interaction between join and sampling, we develop join sampling techniques for the settings where our negative results do not apply. Our new sampling algorithms are significantly more efficient than those known earlier. We present experimental evaluation of our techniques on Microsoft's SQL Server 7.0.

机译：实施采样作为原始关系操作的主要瓶颈是对查询输出进行采样的效率。甚至不知道是否可以生成连接树的样本，而无需完全评估连接树。我们对此问题进行了详细研究，并试图在各种环境中分析它。我们提出了理论结果，解释了这个问题的难度和确定可以实现的效率的限制。基于新的见解进入加入和采样之间的交互，我们开发加入采样技术，了解我们的负面结果不适用的设置。我们的新采样算法明显比前面已知的效率更高。我们对Microsoft的SQL Server 7.0提供了我们技术的实验评估。

著录项

来源
《ACM SIGMOD International Conference on Management of Data》|1999年||共12页
会议地点
作者
Surajit Chaudhuri; Rajeev Motwani; Vivek Narasayya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-532;
关键词

相似文献

外文文献
中文文献
专利

1. Randomly selected order statistics in ranked set sampling: A less expensive comparable alternative to simple random sampling [J] . Amiri Saeid, Jozani Mohammad Jafari, Modarres Reza Environmental and ecological statistics . 2018,第2期

机译：排名集采样中随机选择的订单统计：简单随机采样的较便宜的可比较替代方案
2. Exploiting collider bias to apply two-sample summary data Mendelian randomization methods to one-sample individual level data [J] . Ciarrah Barry, Junxi Liu, Rebecca Richmond, PLoS Genetics . 2021,第8期

机译：利用碰撞者偏见将两个样本摘要数据门铃随机化方法应用于一个样本单个级别数据
3. Effect of Umbilical Cord Blood Sampling versus Admission Blood Sampling on Requirement of Blood Transfusion in Extremely Preterm Infants: A Randomized Controlled Trial [J] . Balasubramanian Haribalakrishna, Malpani Priyanka, Sindhur Mythily, The Journal of pediatrics . 2019,第1期

机译：脐带血样与入院血液取样对极端早产中输血要求的影响：随机对照试验
4. Random Sampling over Streaming Window Joins [C] . Jiadong Ren, Wanchang Jiang, Cong Huo International Symposium on Data, Privacy, and E-Commerce . 2007

机译：随机抽样在流窗口加入
5. Randomization-based inference about latent variables from complex samples: The case of two-stage sampling. [D] . Li, Tiandong. 2012

机译：基于复杂样本的潜在变量的基于随机化的推断：两阶段采样的情况。
6. Random access with a distributed Bitmap Join Index for Star Joins [O] . Jaqueline J. Brito, Thiago Mosqueiro, Ricardo R. Ciferri, 2020

机译：使用星型联接的分布式位图联接索引进行随机访问
7. On Random Sampling over Joins [O] . Surajit Chaudhuri et al. 2008

机译：关于连接的随机抽样
8. Acceptance Sampling using Judgmental and Randomly Selected Samples [R] . Sego, L. H., Shulman, S. A., Anderson, K. K., 2010

机译：使用判断和随机选择的样本进行验收抽样

On Random Sampling over Joins

摘要

著录项

相似文献

相关主题

期刊订阅