首页> 外文会议>2011 IEEE 27th Symposium on Mass Storage Systems and Technologies >Hot data identification for flash-based storage systems using multiple bloom filters
【24h】

Hot data identification for flash-based storage systems using multiple bloom filters

机译:使用多个布隆过滤器的基于闪存的存储系统的热数据识别

获取原文

摘要

Hot data identification can be applied to a variety of fields. Particularly in flash memory, it has a critical impact on its performance (due to a garbage collection) as well as its life span (due to a wear leveling). Although the hot data identification is an issue of paramount importance in flash memory, little investigation has been made. Moreover, all existing schemes focus almost exclusively on a frequency viewpoint. However, recency also must be considered equally with the frequency for effective hot data identification. In this paper, we propose a novel hot data identification scheme adopting multiple bloom filters to efficiently capture finer-grained recency as well as frequency. In addition to this scheme, we propose a Window-based Direct Address Counting (WDAC) algorithm to approximate an ideal hot data identification as our baseline. Unlike the existing baseline algorithm that cannot appropriately capture recency information due to its exponential batch decay, our WDAC algorithm, using a sliding window concept, can capture very fine-grained recency information. Our experimental evaluation with diverse realistic workloads including real SSD traces demonstrates that our multiple bloom filter-based scheme outperforms the state-of-the-art scheme. In particular, ours not only consumes 50% less memory and requires less computational overhead up to 58%, but also improves its performance up to 65%.
机译:热数据识别可以应用于各种领域。尤其是在闪存中,它对其性能(由于收集垃圾)及其寿命(由于损耗平衡)具有关键影响。尽管热数据识别是闪存中最重要的问题,但很少进行研究。此外,所有现有方案几乎都集中在频率观点上。但是,新近度也必须与有效识别热数据的频率同等考虑。在本文中,我们提出了一种新颖的热数据识别方案,该方案采用多个布隆过滤器来有效捕获更细粒度的新近度和频率。除此方案外,我们还提出了一种基于窗口的直接地址计数(WDAC)算法,以近似理想的热数据识别为基准。与现有的基线算法由于其指数批次衰减而无法适当地捕获新近度信息不同,我们的WDAC算法使用滑动窗口概念,可以捕获非常细粒度的新近度信息。我们对包括真实SSD迹线在内的各种实际工作负载进行的实验评估表明,基于多重布隆过滤器的方案优于最新方案。特别是,我们不仅消耗了50%的内存,并减少了58%的计算开销,而且将其性能提高了65%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号