首页> 外文会议>TPC Technology Conference on Performance Evaluation and Benchmarking >Experiences and Lessons in Practice Using TPCx-BB Benchmarks
【24h】

Experiences and Lessons in Practice Using TPCx-BB Benchmarks

机译:使用TPCX-BB基准测试在实践中的经验和课程

获取原文

摘要

The TPCx-BigBench (TPCx-BB) is a TPC Express benchmark, which is designed to measure the performance of big data analytics systems. It contains 30 use cases that simulate big data processing, big data storage, big data analytics, and reporting. We have used this benchmark to evaluate the performance of software and hardware components for big data systems. It has very good coverage on different data types and provides enough scalability to address data size and node scaling problems. We have gained lots of meaningful insights through this benchmark to design analytic systems. In the meantime, we also found we cannot merely rely on TPCx-BB to evaluate and design an end-to-end big data systems. There are some gaps between an analytics system and a real end-to-end system. The whole data flow of a real end-to-end system should include data ingestion, which moves data from where it is originated into a system where it can be stored and analyzed such as Hadoop. Data ingestion may be challenging for businesses at a reasonable speed in order to maintain a competitive advantage. However, TPCx-BB cannot help on performance evaluation of software and hardware for data ingestion. Big data is composed of three dimensions: Volume, Variety, and Velocity. The Velocity refers to the high speed in data processing: real-time or near real-time. With big data technology widely used, real-time and near real-time processing become more popular. There is very strict limitation on bandwidth and latency for real-time processing. TPCx-BB cannot help on performance evaluation of software and hardware for real-time processing. This paper mainly discusses these experiences and lessons in practice using TPCx-BB. Then, we provide some advices to extend TPCx-BB to cover data ingestion and real-time processing. We also share some ideas how to implement TPCx-BB coverage.
机译:TPCX-BigBench(TPCX-BB)是TPC Express基准,旨在衡量大数据分析系统的性能。它包含了30个用例,用于模拟大数据处理,大数据存储,大数据分析和报告。我们使用该基准测试来评估大数据系统的软件和硬件组件的性能。它对不同的数据类型具有很好的覆盖范围,并提供足够的可扩展性来解决数据大小和节点缩放问题。通过这款基准测试来设计分析系统,我们获得了许多有意义的见解。与此同时,我们也发现我们不能仅仅依靠TPCX-BB来评估和设计端到端的大数据系统。分析系统和真实端到端系统之间存在一些间隙。实体端到端系统的整个数据流应包括数据摄取,其将数据从其源自何处存储和分析如hadoop的系统。以合理的速度,数据摄取可能对企业充满挑战,以保持竞争优势。但是,TPCX-BB无法帮助对数据摄取的软件和硬件进行性能评估。大数据由三个维度组成:体积,品种和速度。速度是指数据处理中的高速:实时或近实时。随着大数据技术广泛使用,实时和近实时处理变得更加流行。对实时处理的带宽和延迟非常严格的限制。 TPCX-BB无法帮助对软件和硬件进行实时处理的性能评估。本文主要讨论了使用TPCX-BB实践中的这些经验和课程。然后,我们提供一些建议来扩展TPCX-BB以涵盖数据摄取和实时处理。我们还分享了一些想法如何实现TPCX-BB覆盖范围。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号