首页> 外文期刊>Future generation computer systems >ActiveSort: Efficient external sorting using active SSDs in the MapReduce framework
【24h】

ActiveSort: Efficient external sorting using active SSDs in the MapReduce framework

机译:ActiveSort:使用MapReduce框架中的活动SSD进行有效的外部排序

获取原文
获取原文并翻译 | 示例
       

摘要

In the last decades, there has been an explosion in the volume of data to be processed by data-intensive computing applications. As a result, processing I/O operations efficiently has become an important challenge. SSDs (solid state drives) are an effective solution that not only improves the I/O throughput but also reduces the amount of I/O transfer by adopting the concept of active SSDs. Active SSDs offload a part of the data-processing tasks usually performed in the host to the SSD. Offloading data-processing tasks removes extra data transfer and improves the overall data processing performance. In this work, we propose ActiveSort, a novel mechanism to improve the external sorting algorithm using the concept of active SSDs. External sorting is used extensively in the data-intensive computing frameworks such as Hadoop. By performing merge operations on-the-fly within the SSD, ActiveSort reduces the amount of I/O transfer and improves the performance of external sorting in Hadoop. Our evaluation results on a real SSD platform indicate that the Hadoop applications using ActiveSort outperform the original Hadoop by up to 36.1%. ActiveSort reduces the amount of write by up to 40.4%, thereby improving the lifetime of the SSD.
机译:在过去的几十年中,数据密集型计算应用程序处理的数据量激增。结果,有效地处理I / O操作已成为一项重要的挑战。 SSD(固态驱动器)是一种有效的解决方案,它通过采用活动SSD的概念,不仅可以提高I / O吞吐量,而且可以减少I / O传输量。活动的SSD将通常在主机中执行的部分数据处理任务卸载到SSD。卸载数据处理任务可以消除额外的数据传输,并提高整体数据处理性能。在这项工作中,我们提出了ActiveSort,这是一种使用主动SSD的概念来改进外部排序算法的新颖机制。外部排序在Hadoop等数据密集型计算框架中广泛使用。通过在SSD中即时执行合并操作,ActiveSort减少了I / O传输量并提高了Hadoop中外部排序的性能。我们在真实的SSD平台上的评估结果表明,使用ActiveSort的Hadoop应用程序比原始Hadoop的性能高出36.1%。 ActiveSort最多可减少40.4%的写入量,从而延长了SSD的使用寿命。

著录项

  • 来源
    《Future generation computer systems》 |2016年第12期|76-89|共14页
  • 作者单位

    School of Computing, Korea Advanced Institute of Science and Technology, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea;

    College of Information and Communication Engineering Sungkyunkwan University, 2066 Seobu-Ro, Jangan-gu, Suwon 16419, Republic of Korea;

    School of Computing, Korea Advanced Institute of Science and Technology, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea;

    College of Information and Communication Engineering Sungkyunkwan University, 2066 Seobu-Ro, Jangan-gu, Suwon 16419, Republic of Korea;

    School of Computing, Korea Advanced Institute of Science and Technology, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Data-intensive computing; MapReduce; External sorting; Solid state drives; In-storage processing;

    机译:数据密集型计算;MapReduce;外部分类;固态驱动器;入库处理;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号