首页> 外文期刊>Genetic programming and evolvable machines >Evolution of human-competitive lossless compression algorithms with GP-zip2
【24h】

Evolution of human-competitive lossless compression algorithms with GP-zip2

机译:GP-zip2促进人类竞争的无损压缩算法的发展

获取原文
获取原文并翻译 | 示例
       

摘要

We propose GP-zip2, a new approach to lossless data compression based on Genetic Programming (GP). GP is used to optimally combine well-known lossless compression algorithms to maximise data compression. GP-zip2 evolves programs with multiple components. One component analyses statistical features extracted by sequentially scanning the data to be compressed and divides the data into blocks. These blocks are projected onto a two-dimensional Euclidean space via two further (evolved) program components. K-means clustering is then applied to group similar data blocks. Each cluster is labelled with the optimal compression algorithm for its member blocks. After evolution, evolved programs can be used to compress unseen data. The compression algorithms available to GP-zip2 are: Arithmetic coding, Lempel-Ziv-Welch, Unbounded Prediction by Partial Matching, Run Length Encoding, and Bzip2. Experimentation shows that the results produced by GP-zip2 are human-competitive, being typically superior to well-established human-designed compression algorithms in terms of the compression ratios achieved in heterogeneous archive files.
机译:我们提出GP-zip2,这是一种基于遗传编程(GP)的无损数据压缩新方法。 GP用于最佳地组合众所周知的无损压缩算法以最大化数据压缩。 GP-zip2开发具有多个组件的程序。一个组件分析通过依次扫描要压缩的数据而提取的统计特征,并将数据划分为多个块。这些块通过另外两个(演化的)程序组件投影到二维欧几里得空间上。然后,将K均值聚类应用于相似数据块的分组。每个群集都为其成员块标记了最佳压缩算法。演化后,可以使用演化后的程序来压缩看不见的数据。 GP-zip2可用的压缩算法是:算术编码,Lempel-Ziv-Welch,部分匹配的无界预测,游程长度编码和Bzip2。实验表明,GP-zip2产生的结果具有人类竞争力,就异构档案文件中实现的压缩率而言,它通常优于公认的人为设计的压缩算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号