Collaborative Compaction Optimization System using Near-Data Processing for LSM-tree-based Key-Value Stores

Sun Hui; Liu Wei; Huang Jianzhong; Shi Weisong

首页> 外文期刊>Journal of Parallel and Distributed Computing >Collaborative Compaction Optimization System using Near-Data Processing for LSM-tree-based Key-Value Stores

【24h】

Collaborative Compaction Optimization System using Near-Data Processing for LSM-tree-based Key-Value Stores

机译：基于LSM树的键值存储的近数据协作压缩优化系统

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Log-structured merge tree (LSM-tree) based key-value stores are widely employed in large-scale storage systems. In compaction, high-level sorted string table files (i.e., SSTables) are merged with low-level overlapping key ranges and sorted for data queries. However, the compaction process incurs write amplification, which degrades system performance particularly under update-intensive workloads. Current optimizations mostly focus on reducing the overload of compaction in the host but rarely exploit parallelisms between the host and the device to fully utilize computing resources in the entire system for optimizing compaction performance. In this study, we propose Co-KV, a Collaborative Key-Value store, to improve compaction performance in LSM-tree-based key-value stores. Co-KV is based on a near-data processing (i.e., NDP) model-enabled storage device. Co-KV exhibits the following advantages: (1) it reduces write amplification and host-side CPU costs using a compaction offloading scheduling between a host computer and an NDP-enabled storage device; (2) it relieves the overload associated with data transfer between the host and the storage device; and (3) it improves the compaction of LSM-tree based key-value stores under update-intensive workloads.We employed an Open Ethernet Driver (OED), which is a real-world NDP platform as the testbed for our experiments. Extensive db_bench evaluations demonstrate that Co-KV achieves overall throughput improvements of approximately 1.75x, CPU cost reductions of approximately 68.1%, and write amplification reductions by up to 50.0% over the state-of-the-art LevelDB. Under YCSB workloads, Co-IN increases throughput by 1.7x similar to 1.9x while decreasing the write amplification and average latency by up to 50.0% and 46.3%, respectively. (C) 2019 Elsevier Inc. All rights reserved.

机译：基于日志结构的合并树（LSM-tree）的键值存储已在大规模存储系统中广泛采用。在压缩中，将高级排序的字符串表文件（即SSTables）与低级别的重叠键范围合并，并进行排序以进行数据查询。但是，压缩过程会导致写入放大，这会降低系统性能，尤其是在更新密集型工作负载下。当前的优化主要集中在减少主机中压缩的负担上，但是很少利用主机和设备之间的并行性来充分利用整个系统中的计算资源来优化压缩性能。在这项研究中，我们提出了Co-KV（协作键值存储），以提高基于LSM树的键值存储的压缩性能。 Co-KV基于启用了近数据处理（即NDP）模型的存储设备。 Co-KV具有以下优点：（1）通过在主机计算机和启用NDP的存储设备之间使用压缩卸载调度来减少写放大和主机端CPU成本；（2）减轻了主机与存储设备之间数据传输带来的过载；（3）改进了在更新密集型工作负载下基于LSM树的键值存储的压缩。我们使用了开放式以太网驱动程序（OED），它是真实的NDP平台作为我们实验的测试平台。广泛的db_bench评估表明，与最先进的LevelDB相比，Co-KV的整体吞吐量提高了约1.75倍，CPU成本降低了约68.1％，写放大降低了多达50.0％。在YCSB工作负载下，Co-IN将吞吐量提高了1.9倍，是1.9倍，同时将写放大和平均延迟分别降低了50.0％和46.3％。（C）2019 Elsevier Inc.保留所有权利。

著录项

来源
《Journal of Parallel and Distributed Computing》 |2019年第9期|29-43|共15页
作者
Sun Hui; Liu Wei; Huang Jianzhong; Shi Weisong;
展开▼
作者单位

Anhui Univ, Sch Comp Sci & Technol, Key Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei 230601, Anhui, Peoples R China;

Anhui Univ, Sch Comp Sci & Technol, Key Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei 230601, Anhui, Peoples R China;

Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430074, Hubei, Peoples R China;

Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
LSM-tree; Key-value store; Near-data processing; Compaction; Partitioning; Offloading;

机译：LSM树;键值存储;近数据处理;压实;分区;卸载;

相似文献

外文文献
中文文献
专利

1. Collaborative Compaction Optimization System using Near-Data Processing for LSM-tree-based Key-Value Stores [J] . Sun Hui, Liu Wei, Huang Jianzhong, Journal of Parallel and Distributed Computing . 2019,第Sepa期

机译：基于LSM-Tree的键值存储使用近数据处理的协同压缩优化系统
2. Texas Eastern Defends New Gas Interchangeability and Quality Specifications, the Collaborative Process to Develop Them, and the Belief That Major Changes Are in Store for its System Due to Natural Gas Resource Shifts Driven by Marcellus and Haynesville Shale Supply and LNG [J] . Foster Natural Gas Report Group Foster Natural Gas Report . 2009,第2768期

机译：德州东部捍卫新的天然气互换性和质量规范，开发它们的协作过程以及相信由于Marcellus和Haynesville页岩供应和液化天然气驱动的天然气资源转移，其系统即将发生重大变化的信念
3. An improved hierarchical genetic algorithm for collaborative optimization of manufacturing processes in metal structure manufacturing systems: [J] . Dezhong Qi, Sanqiang Zhang, Mingyong Liu, Advances in Mechanical Engineering . 2017,第3期

机译：用于金属结构制造系统中制造过程的协作优化的改进的分层遗传算法：
4. LDC: A Lower-Level Driven Compaction Method to Optimize SSD-Oriented Key-Value Stores [C] . Yunpeng Chai, Yanfeng Chai, Xin Wang, IEEE International Conference on Data Engineering . 2019

机译：LDC：一种用于优化面向SSD的键值存储的下层驱动压缩方法
5. Building a scalable and high-performance key-value store system. [D] . Xu, Yuehai. 2014

机译：构建可扩展的高性能键值存储系统。
6. Real and synthetic data sets for benchmarking key-value stores focusing on various data types and sizes [O] . Hyuk-Yoon Kwon 2020

机译：真实和综合数据集用于基准测试关键值存储重点是各种数据类型和大小
7. Optimization of Power Inputs of Pneumo-Transport and Storing of Fine-Grained Building Materials on the Base of Adaptive Micro-Processed System [O] . Tatjana Souetina, Thomas Bock, Alexej Bulgakow 2001

机译：适应性微处理系统底座电力运输电力输入功率输入的优化

Collaborative Compaction Optimization System using Near-Data Processing for LSM-tree-based Key-Value Stores

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅