【24h】

Accelerating Big Data Applications on Tiered Storage System with Various Eviction Policies

机译:运用各种驱逐策略加速分层存储系统上的大数据应用

获取原文
获取原文并翻译 | 示例

摘要

Utilizing new type devices, such as SSD, to improve I/O performance of hybrid storage has become a tendency recently. Many efforts are made to apply the new type devices to hybrid storage in distributed environment, but most of them are confined to the specific file systems, such as HDFS. Besides, the low performance of HDFS descends the performance of hybrid storage. In this paper, we improve the performance of tiered storage system (one kind of hybrid storage system) in distributed environment with a plughable eviction framework considering that the data on each node is regularly accessed. On top of the eviction framework, we provide a couple of eviction policies, including LRU, LRFU, LIRS and ARC, covering different access patterns to accelerate the upper big data applications. Moreover, our design is general for all tiered storage systems. Then we evaluate the performance of our eviction framework through three widely-used big data applications and discover that LIRS can improve 30% hit ratio than most of other policies when running KMeans and PageRank, ARC can improve maximum 30% hit ratio than other policies when running complicated SQL applications, LRFU can always achieve relatively good performance when the configuration properties are set in reasonable range. We have implemented our prototype on Alluxio, which is a widely-used memory-centric distributed storage system. In addition, these eviction policies contributed by us have been merged into Alluxio and are already being in use.
机译:最近,利用固态硬盘等新型设备来提高混合存储的I / O性能已成为一种趋势。为了将新型设备应用于分布式环境中的混合存储,已经做了很多努力,但是大多数设备仅限于特定的文件系统,例如HDFS。此外,HDFS的低性能降低了混合存储的性能。在本文中,考虑到定期访问每个节点上的数据,我们使用可插拔驱逐框架来提高分布式环境中分层存储系统(一种混合存储系统)的性能。在驱逐框架之上,我们提供了两种驱逐策略,包括LRU,LRFU,LIRS和ARC,涵盖了不同的访问模式以加速上层大数据应用程序。此外,我们的设计适用于所有分层存储系统。然后,我们通过三个广泛使用的大数据应用程序评估驱逐框架的性能,发现运行KMeans和PageRank时,LIRS的命中率比大多数其他策略高30%,而当运行KMeans和PageRank时,ARC可以将最大命中率提高30%。在复杂的SQL应用程序上运行时,只要将配置属性设置在合理的范围内,LRFU就能始终获得相对较好的性能。我们已经在Alluxio上实现了原型,Alluxio是一个广泛使用的以内存为中心的分布式存储系统。此外,我们提供的这些搬迁政策已合并到Alluxio中,并且已经在使用中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号