首页> 外文会议>IEEE International Conference on Big Data Science and Engineering >MLC: An Efficient Multi-level Log Compression Method for Cloud Backup Systems
【24h】

MLC: An Efficient Multi-level Log Compression Method for Cloud Backup Systems

机译:MLC:用于云备份系统的有效的多级日志压缩方法

获取原文

摘要

With the rapid development of Internet and cloud services, logs become one of the fastest growing data types in backup storage systems. These massive data always require long-term storage, which incurs high storage overhead. Typical compression algorithms, such as traditional lossless compression and log-specific compression algorithms, are employed to increase storage efficiency. However, these algorithms ignore the inherent semantics of logs, particularly for the well-structured compositions within log records and the striking similarities among them. Therefore, they cannot guarantee a satisfactory compression ratio. To address this problem, we propose a novel Multi-level Log Compression (MLC) method for cloud backup systems. MLC can achieve high compression ratio for various applications and workloads. Different from existing compression algorithms, MLC first explores data redundancy among log records and divides them into different buckets in accordance with their similarities. Then, the log records are condensed by a variation of delta compression. After that, a traditional compression algorithm is employed as the secondary compression to further improve the compression ratio. To demonstrate the effectiveness of MLC, we conduct several experiments under different log workloads. The results show that, MLC improves the compression ratio of 7zip, gzip and bzip2 by up to 30.3%, 26.8% and 16.1%, respectively.
机译:随着互联网和云服务的快速发展,日志成为备份存储系统中最快的数据类型之一。这些大规模数据始终需要长期存储,引起高存储器开销。使用典型的压缩算法,例如传统的无损压缩和日志特定的压缩算法,以提高存储效率。然而,这些算法忽略了日志的固有语义,特别是对于日志记录中的结构良好的组合物以及它们之间的醒目相似之处。因此,他们无法保证令人满意的压缩比。为了解决这个问题,我们提出了一种用于云备份系统的新型多级日志压缩(MLC)方法。 MLC可以实现各种应用和工作负载的高压缩比。与现有的压缩算法不同,MLC首先探讨日志记录之间的数据冗余,并根据其相似度将它们划分为不同的桶。然后,日志记录通过Δ压缩的变化来凝聚。之后,使用传统的压缩算法作为次要压缩,以进一步提高压缩比。为了证明MLC的有效性,我们在不同的日志工作量下进行几个实验。结果表明,MLC将7 Zip,Gzip和Bzip2的压缩比率提高至30.3%,26.8%和16.1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号