首页> 外文会议>SIGMOD/PODS 2007 >Lazy, Adaptive RID-List Intersection, and Its Application to Index Anding

【24h】

Lazy, Adaptive RID-List Intersection, and Its Application to Index Anding

机译：懒惰，自适应RID-List交叉点及其在索引Anding中的应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

RID-List (row id list) intersection is a common strategy in query processing, used in star joins, column stores, and even search engines. To apply a conjunction of predicates on a table, a query processor does index lookups to form sorted RID-lists (or bitmap) of the rows matching each predicate, then intersects the RID-lists via an AND-tree, and finally fetches the corresponding rows to apply any residual predicates and aggregates. This process can be expensive when the RID-lists are large. Furthermore, the performance is sensitive to the order in which RIDlists are intersected together, and to treating the right predicates as residuals. If the optimizer chooses a wrong order or a wrong residual, due to a poor cardinality estimate, the resulting plan can run orders of magnitude slower than expected. We present a new algorithm for RID-list intersection that is both more efficient and more robust than this standard algorithm. First, we avoid forming the RID-lists up front, and instead form this lazily as part of the intersection. This reduces the associated IO and sort cost significantly, especially when the data distribution is skewed. It also ameliorates the problem of wrong residual table selection. Second, we do not intersect the RID-lists via an AND-tree, because this is vulnerable to cardinality mis-estimations. Instead, we use an adaptive set intersection algorithm that performs well even when the cardinality estimates are wrong. We present detailed experiments of this algorithm on data with varying distributions to validate its efficiency and predictability.

机译：RID-List（行ID列表）交集是查询处理中的常用策略，用于星型联接，列存储，甚至搜索引擎。为了在表上应用谓词的合取，查询处理器进行索引查找以形成与每个谓词匹配的行的排序RID列表（或位图），然后通过AND树与RID列表相交，最后获取对应的行以应用任何残留谓词和聚合。当RID列表很大时，此过程可能会很昂贵。此外，性能对于将RIDlist交叉在一起的顺序以及将正确的谓词视为残差都非常敏感。如果由于基数估计不正确，优化器选择了错误的顺序或错误的残差，则生成的计划的运行速度可能会比预期的慢几个数量级。我们提出了一种新的RID列表相交算法，它比该标准算法更有效，更健壮。首先，我们避免预先形成RID列表，而将其作为交集的一部分而懒惰地形成。这样可以显着降低相关的IO和排序成本，尤其是在数据分配不正确时。它还改善了错误的剩余表选择问题。其次，我们不通过AND树相交RID列表，因为它容易受到基数错误估计的影响。取而代之的是，我们使用自适应集合相交算法，即使基数估计错误，该算法也能很好地执行。我们目前对具有不同分布的数据进行该算法的详细实验，以验证其效率和可预测性。

著录项

来源
《SIGMOD/PODS 2007》|2007年|P.773-784|共12页
会议地点
作者
Vijayshankar Raman; Lin Qiao; Wei Han; Inderpal S. Narang; Ying-Lin Chen; Kou-Horng Yang; Fen-Lin Ling;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13;
关键词
AND tree; Intersection; Star Join; Lazy; Sorting;

机译：AND树;交叉点;星型加入;懒惰;排序;

相似文献

外文文献
中文文献
专利

1. Intersection Management via Vehicle Connectivity: The Intersection Cooperative Adaptive Cruise Control System Concept [J] . Zohdy Ismail H., Rakha Hesham A. ITS Journal . 2016,第1a6期

机译：通过车辆连通性进行路口管理：路口协作自适应巡航控制系统概念
2. Review on lazy learning regressors and their applications in QSAR. [J] . Kulkarni AJ, Jayaraman VK, Kulkarni BD Combinatorial chemistry & high throughput screening . 2009,第4期

机译：延迟学习回归器及其在QSAR中的应用综述。
3. Review on Lazy Learning Regressors and their Applications in QSAR [J] . Abhijit J. Kulkarni Valadi K. Jayaraman Bhaskar D. Kulkarni Combinatorial Chemistry & High Throughput Screening . 2009,第4期

机译：懒学习回归器及其在QSAR中的应用
4. Lazy, adaptive rid-list intersection, and its application to index anding [C] . Vijayshankar Raman, Lin Qiao, Wei Han, ACM SIGMOD international conference on Management of data . 2007

机译：惰性自适应列表列表交集及其在索引和运算中的应用
5. ALGEBRAIC TYPES IN LAZILY EVALUATED APPLICATIVE LANGUAGES. [D] . THATTE, SATISH R. 1982

机译：延迟评估的适用语言中的代数类型。
6. A Novel Automated Lazy Learning QSAR (ALL-QSAR) Approach: Method Development Applications and Virtual Screening of Chemical Databases Using Validated ALL-QSAR Models [O] . Shuxing Zhang, Alexander Golbraikh, Scott Oloff, -1

机译：一种新颖的自动延迟学习QSAR（ALL-QSAR）方法：使用经过验证的ALL-QSAR模型对化学数据库进行方法开发应用和虚拟筛选
7. Efficient Oblivious Pseudorandom Function with Applications to Adaptive OT and Secure Computation of Set Intersection [O] . Stanisław Jarecki, Xiaomin Liu 2009

机译：高效的伪伪随机函数函数应用于自适应OT和SET SET交叉点的安全计算

Lazy, Adaptive RID-List Intersection, and Its Application to Index Anding

摘要

著录项

相似文献

相关主题

期刊订阅