首页> 外文期刊>Journal of Parallel and Distributed Computing >Collaborative Compaction Optimization System using Near-Data Processing for LSM-tree-based Key-Value Stores
【24h】

Collaborative Compaction Optimization System using Near-Data Processing for LSM-tree-based Key-Value Stores

机译:基于LSM树的键值存储的近数据协作压缩优化系统

获取原文
获取原文并翻译 | 示例

摘要

Log-structured merge tree (LSM-tree) based key-value stores are widely employed in large-scale storage systems. In compaction, high-level sorted string table files (i.e., SSTables) are merged with low-level overlapping key ranges and sorted for data queries. However, the compaction process incurs write amplification, which degrades system performance particularly under update-intensive workloads. Current optimizations mostly focus on reducing the overload of compaction in the host but rarely exploit parallelisms between the host and the device to fully utilize computing resources in the entire system for optimizing compaction performance. In this study, we propose Co-KV, a Collaborative Key-Value store, to improve compaction performance in LSM-tree-based key-value stores. Co-KV is based on a near-data processing (i.e., NDP) model-enabled storage device. Co-KV exhibits the following advantages: (1) it reduces write amplification and host-side CPU costs using a compaction offloading scheduling between a host computer and an NDP-enabled storage device; (2) it relieves the overload associated with data transfer between the host and the storage device; and (3) it improves the compaction of LSM-tree based key-value stores under update-intensive workloads.We employed an Open Ethernet Driver (OED), which is a real-world NDP platform as the testbed for our experiments. Extensive db_bench evaluations demonstrate that Co-KV achieves overall throughput improvements of approximately 1.75x, CPU cost reductions of approximately 68.1%, and write amplification reductions by up to 50.0% over the state-of-the-art LevelDB. Under YCSB workloads, Co-IN increases throughput by 1.7x similar to 1.9x while decreasing the write amplification and average latency by up to 50.0% and 46.3%, respectively. (C) 2019 Elsevier Inc. All rights reserved.
机译:基于日志结构的合并树(LSM-tree)的键值存储已在大规模存储系统中广泛采用。在压缩中,将高级排序的字符串表文件(即SSTables)与低级别的重叠键范围合并,并进行排序以进行数据查询。但是,压缩过程会导致写入放大,这会降低系统性能,尤其是在更新密集型工作负载下。当前的优化主要集中在减少主机中压缩的负担上,但是很少利用主机和设备之间的并行性来充分利用整个系统中的计算资源来优化压缩性能。在这项研究中,我们提出了Co-KV(协作键值存储),以提高基于LSM树的键值存储的压缩性能。 Co-KV基于启用了近数据处理(即NDP)模型的存储设备。 Co-KV具有以下优点:(1)通过在主机计算机和启用NDP的存储设备之间使用压缩卸载调度来减少写放大和主机端CPU成本; (2)减轻了主机与存储设备之间数据传输带来的过载; (3)改进了在更新密集型工作负载下基于LSM树的键值存储的压缩。我们使用了开放式以太网驱动程序(OED),它是真实的NDP平台作为我们实验的测试平台。广泛的db_bench评估表明,与最先进的LevelDB相比,Co-KV的整体吞吐量提高了约1.75倍,CPU成本降低了约68.1%,写放大降低了多达50.0%。在YCSB工作负载下,Co-IN将吞吐量提高了1.9倍,是1.9倍,同时将写放大和平均延迟分别降低了50.0%和46.3%。 (C)2019 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号