首页> 外文会议>Performance evaluation and benchmarking for the era of artificial Intelligence >Benchmarking and Performance Analysis of Event Sequence Queries on Relational Database
【24h】

Benchmarking and Performance Analysis of Event Sequence Queries on Relational Database

机译:关系数据库上事件序列查询的基准测试和性能分析

获取原文
获取原文并翻译 | 示例

摘要

The relational database has been the fundamental technology for data-driven decision making based on the histories of event occurrences about the analysis target. Thus the performance of analytical workloads in relational databases has been studied intensively. As a common language for performance analysis, decision support benchmarks such as TPC-H have been widely used. These benchmarks focus on summarization of the event occurrence information. Individual event occurrences or inter-occurrence associations are rarely examined in these benchmarks. However, this type of query, called an event sequence query in this paper, is becoming important in various real-world applications. Typically, an event sequence query extracts event sequences starting from a small number of interesting event occurrences. In a relational database, these queries are described by multiple self-joins on the whole sequence of events. Furthermore, each pair of events to be joined tends to have a strong correlation in the timestamp attribute, resulting in heavily skewed join workloads. Despite the usefulness in real-world data analysis, very little work has been done on performance analysis of event sequence queries. In this paper, we present the initial design of ESQUE benchmark, a benchmark for event sequence queries. We then give experimental results of the comparison of database system implementations: PostgreSQL v.s. MySQL, and the comparison of historical versions of PostgreSQL. Conducted performance analysis shows that ESQUE benchmark allows us to discover performance problems which had been overlooked in existing benchmarks.
机译:关系数据库已经成为基于有关分析目标的事件发生历史来进行数据驱动决策的基础技术。因此,对关系数据库中分析工作负载的性能进行了深入研究。作为性能分析的通用语言,决策支持基准(例如TPC-H)已被广泛使用。这些基准测试集中于事件发生信息的汇总。在这些基准测试中,很少检查单个事件的发生或发生的关联。但是,这种查询在本文中称为事件序列查询,在各种实际应用中变得越来越重要。通常,事件序列查询从少量有趣的事件发生中提取事件序列。在关系数据库中,这些查询是通过整个事件序列上的多个自联接描述的。此外,要加入的每对事件在timestamp属性中往往具有很强的相关性,从而导致大量的加入工作负载出现偏差。尽管在现实世界的数据分析中很有用,但是在事件序列查询的性能分析上所做的工作很少。在本文中,我们介绍了ESQUE基准测试的初始设计,该标准是事件序列查询的基准。然后,我们给出了数据库系统实现比较的实验结果:PostgreSQL v.s. MySQL,以及PostgreSQL历史版本的比较。进行的性能分析表明,ESQUE基准使我们能够发现在现有基准中被忽略的性能问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号