【24h】

Split File Model for Big Data in Low Throughput Storage

机译:低吞吐量存储中大数据的拆分文件模型

获取原文
获取原文并翻译 | 示例

摘要

The demand for low-cost, large-scale storage is increasing. Recently, several low-throughput storage services such as the Pogo plug Cloud have been developed. These services are based on Amazon Glacier. They have low throughput, but low cost and large capacity. Therefore, these services are suitable for backups or archiving big data and can be used instead of offline storage tiers. To utilize such low throughput storage efficiently, we need tools for effective deduplication and resumable transfers, amongst others. We propose a split file model that can represent big data efficiently in low throughput storage. In the split file model, a large file is divided into many small parts, which are stored in a directory. We have developed tool commands to support the use of split files in a transparent way. Using these commands, replicated data is naturally excluded and effective shallow copying is supported. In this paper, we describe the split file model in detail and evaluate an implementation thereof.
机译:对低成本,大规模存储的需求正在增长。最近,已经开发了几种低吞吐量的存储服务,例如Pogo插件云。这些服务基于Amazon Glacier。它们具有低吞吐量,但成本低且容量大。因此,这些服务适用于备份或归档大数据,并且可以代替脱机存储层使用。为了有效利用如此低的吞吐量存储,我们需要工具来进行有效的重复数据删除和可恢复的传输等。我们提出了一个拆分文件模型,该模型可以在低吞吐量存储中高效地表示大数据。在拆分文件模型中,大文件分为许多小部分,这些小部分存储在目录中。我们已经开发了工具命令,以透明方式支持分割文件的使用。使用这些命令,自然就排除了复制的数据,并支持有效的浅表复制。在本文中,我们详细描述了拆分文件模型并评估了其实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号