Introducing Skew into the TPC-H Benchmark

机译：在TPC-H基准测试中引入偏斜

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

While uniform data distributions were a design choice for the TPC-D benchmark and its successor TPC-H, it has been universally recognized that data skew is prevalent in data warehousing. A modern benchmark should therefore provide a test bed to evaluate the ability of database engines to handle skew. This paper introduces a concrete and practical way to introduce skew in the TPC-H data model by modifying the customer and supplier tables to reflect non-uniform customer and supplier populations. The first proposal consists in defining customer and supplier populations by nation that are roughly proportional to the actual nation populations. In a second proposal, nations are divided into two groups, one with large and equal populations and the other with equal and small populations. We then experiment with the proposed skew models to show how the optimizer of a parallel system can recognize skew and potentially produce different plans depending on the presence of skew. A comparison is made between query performance with the proposed method vs. the original uniform TPC-H distributions. Finally, an approach is presented to introduce skew into TPC-H with the current query set that is compatible with the current benchmark specification rules and could be implemented today.

机译：虽然统一的数据分发是TPC-D基准测试及其后继TPC-H的设计选择，但人们普遍认为数据偏斜在数据仓库中很普遍。因此，现代基准测试应该提供一个测试平台，以评估数据库引擎处理偏斜的能力。本文通过修改客户和供应商表以反映不均匀的客户和供应商数量，介绍了一种在TPC-H数据模型中引入偏斜的具体可行方法。第一个建议是按国家定义与实际国家人口大致成比例的客户和供应商人口。在第二个提案中，国家分为两组，一组人口大而平等，而另一组人口小而平等。然后，我们对提出的偏斜模型进行实验，以显示并行系统的优化程序如何识别偏斜并根据偏斜的存在可能产生不同的计划。在提出的方法的查询性能与原始均匀TPC-H分布之间进行了比较。最后，提出了一种方法，该方法将具有当前查询集的TPC-H中的时滞引入到当前的基准规范规则中，并且可以在今天实现。

著录项

来源
《Topics in performance evaluation, measurement and characterization》|2011年|137-145|共9页
会议地点 Seattle WA(US)
作者
Alain Crolotte; Ahmad Ghazal;
展开▼
作者单位

Teradata Corporation, 100 N. Sepulveda Blvd. El Segundo, Ca. 90245;

Teradata Corporation, 100 N. Sepulveda Blvd. El Segundo, Ca. 90245;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Benchmarking partner selection: introducing the AHP method in the benchmarking process to define best practice partners [J] . J. Razmi, S.F. Ghaderi, P.K. Ahmed International Journal of Management Practice . 2005,第3期

机译：基准测试合作伙伴选择：在基准测试过程中引入AHP方法以定义最佳实践合作伙伴
2. FREE VIBRATION OF SKEW LAMINATES - A BRIEF REVIEW AND SOME BENCHMARK RESULTS [J] . S Haldar, S Pal, K Kalita The Transactions of the Royal Institution of Naval Architects,Part A:International journal of maritime engineering . 2019,第Pta4期

机译：偏斜层压板的自由振动 - 简要评论和一些基准结果
3. High-frequency asymptotic solutions benchmarking skew incidence diffraction by anisotropic impedance half and full planes [J] . Paolo Nepa, Giuliano Manara, Andreina Armogida, Radio Science . 2007,第6期

机译：高频渐近解，通过各向异性阻抗半平面和全平面对偏斜入射衍射进行标定
4. Introducing Skew into the TPC-H Benchmark [C] . Alain Crolotte, Ahmad Ghazal Technology Conference on Performance Evaluation and Benchmarking . 2012

机译：引入TPC-H基准
5. A decomposition algorithm of skew-symmetric and skew-symmetrizable exchange matrices. [D] . Gu, Weiwen. 2012

机译：斜对称和不可对称交换矩阵的分解算法。
6. The associations between work–life balance behaviours teamwork climate and safety climate: cross-sectional survey introducing the work–life climate scale psychometric properties benchmarking data and future directions [O] . J Bryan Sexton, Stephanie P Schwartz, Whitney A Chadwick, -1

机译：工作与生活平衡的行为团队合作气氛与安全气氛之间的关联：横断面调查介绍了工作与生活的气候规模心理计量特性基准数据和未来方向
7. TPC-H Analyzed: Hidden Messages and Lessons Learned from an Influential Benchmark [O] . Boncz, Peter, Neumann, Thomas, Erling, Orri, 2013

机译：TPC-H分析：从有影响力的基准中学到的隐性消息和教训

Introducing Skew into the TPC-H Benchmark

摘要

著录项

相似文献

相关主题

期刊订阅