首页> 外文会议>European Semantic Web Conference >KOBE: Cloud-Native Open Benchmarking Engine for Federated Query Processors
【24h】

KOBE: Cloud-Native Open Benchmarking Engine for Federated Query Processors

机译:科比:联合查询处理器的云原生开放基准引擎

获取原文

摘要

In the SPARQL query processing community, as well as in the wider databases community, benchmark reproducibility is based on releasing datasets and query workloads. However, this paradigm breaks down for federated query processors, as these systems do not manage the data they serve to their clients but provide a data-integration abstraction over the actual query processors that are in direct contact with the data. As a consequence, benchmark results can be greatly affected by the performance and characteristics of the underlying data services. This is further aggravated when one considers benchmarking in more realistic conditions, where internet latency and throughput between the federator and the federated data sources is also a key factor. In this paper we present KOBE, a benchmarking system that leverages modern containerization and Cloud computing technologies in order to reproduce collections of data sources. In KOBE, data sources are formally described in more detail than what is conventionally provided, covering not only the data served but also the specific software that serves it and its configuration as well as the characteristics of the network that connects them. KOBE provides a specification formalism and a command-line interface that completely hides from the user the mechanics of provisioning and orchestrating the benchmarking process on Kubernetes-based infrastructures; and of simulating network latency. Finally, KOBE automates the process of collecting and comprehending logs, and extracting and visualizing evaluation metrics from these logs.
机译:在SPARQL查询处理社区中,以及更广泛的数据库社区中,基准再现性基于释放数据集和查询工作负载。但是,此范例对于联合查询处理器分解,因为这些系统不管理它们为客户端服务的数据,但提供与与数据直接接触的实际查询处理器上的数据集成抽象。因此,基准结果可能会受到底层数据服务的性能和特征的大大影响。当一个人考虑在更现实的条件下进行基准时,这进一步加剧了,其中Federator和联邦数据源之间的互联网延迟和吞吐量也是一个关键因素。在本文中,我们呈现Kobe,这是一种利用现代集装箱和云计算技术的基准系统,以便再现数据源的集合。在科比中,数据源比传统提供的更详细地详细描述,不仅覆盖了所服务的数据,还覆盖了它的特定软件及其配置以及连接它们的网络的特征。 Kobe提供了一种规范形式主义和一个完全隐藏的命令行界面,这些界面完全隐藏在供应的机制和协调基于Kubernetes基础设施的基准过程;并模拟网络延迟。最后,Kobe自动完成收集和理解日志的过程,以及从这些日志中提取和可视化评估度量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号