首页> 外文会议>TPC Technology Conference on Performance Evaluation and Benchmarking >Benchmarking and Performance Analysis of Event Sequence Queries on Relational Database
【24h】

Benchmarking and Performance Analysis of Event Sequence Queries on Relational Database

机译:关系数据库事件序列查询的基准和性能分析

获取原文

摘要

The relational database has been the fundamental technology for data-driven decision making based on the histories of event occurrences about the analysis target. Thus the performance of analytical workloads in relational databases has been studied intensively. As a common language for performance analysis, decision support benchmarks such as TPC-H have been widely used. These benchmarks focus on summarization of the event occurrence information. Individual event occurrences or inter-occurrence associations are rarely examined in these benchmarks. However, this type of query, called an event sequence query in this paper, is becoming important in various real-world applications. Typically, an event sequence query extracts event sequences starting from a small number of interesting event occurrences. In a relational database, these queries are described by multiple self-joins on the whole sequence of events. Furthermore, each pair of events to be joined tends to have a strong correlation in the timestamp attribute, resulting in heavily skewed join workloads. Despite the usefulness in real-world data analysis, very little work has been done on performance analysis of event sequence queries. In this paper, we present the initial design of ESQUE benchmark, a benchmark for event sequence queries. We then give experimental results of the comparison of database system implementations: PostgreSQL v.s. MySQL, and the comparison of historical versions of PostgreSQL. Conducted performance analysis shows that ESQUE benchmark allows us to discover performance problems which had been overlooked in existing benchmarks.
机译:关系数据库是基于关于分析目标的事件历史的数据驱动决策的基本技术。因此,密集地研究了关系数据库中的分析工作负载的性能。作为绩效分析的常用语言,已广泛使用决策支持基准,例如TPC-H。这些基准符合事件发生信息的总结。在这些基准中很少检查各个事件发生或发生间发生的关联。但是,在本文中称为事件序列查询的这种类型的查询在各种真实应用中都变得重要。通常,事件序列查询从少量有趣的事件发生开始提取事件序列。在关系数据库中,这些查询由整个事件序列上的多个自行连接描述。此外,要加入的每对事件倾向于在时间戳属性中具有很强的相关性,从而产生重大偏斜的连接工作负载。尽管在现实世界的数据分析中具有有用性,但在事件序列查询的性能分析方面取得了很少的工作。在本文中,我们介绍了ESQue基准测试的初始设计,是事件序列查询的基准。然后,我们提供数据库系统实现比较的实验结果:PostgreSQL V.S. MySQL,以及PostgreSQL历史版本的比较。进行的绩效分析表明,ESQUE基准允许我们发现在现有的基准中被忽视的性能问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号