首页> 外国专利> MULTI-THREAD FAST STORAGE LOSSLESS COMPRESSION METHOD AND SYSTEM FOR FASTQ DATA

MULTI-THREAD FAST STORAGE LOSSLESS COMPRESSION METHOD AND SYSTEM FOR FASTQ DATA

机译:FASTQ数据的多线程快速存储无损压缩方法和系统

摘要

Provided is a multi-thread fast storage lossless compression method for FASTQ data, which is applied to compression of a DNA sequence. The method comprises: a data classification step of inputting original FASTQ data, and dividing a short reading of the original FASTQ data into three data flows, namely metadata, a mass fraction, and a base sequence (S11); a data compression step of: with respect to the metadata, using incremental encoding to detect and eliminate redundant information of the metadata; with respect to the mass fraction, using a bit level PPM prediction model and arithmetic coding for compression; and with respect to the base sequence, using improved arithmetic coding of a fixed order for compression (S12); and a data output step of archiving and merging compression results of different data flows, and outputting final data after the compression (S13). The solution can improve the compression efficiency and compression speed.
机译:提供了一种用于FASTQ数据的多线程快速存储无损压缩方法,该方法被应用于DNA序列的压缩。该方法包括:数据分类步骤,输入原始FASTQ数据,并将对原始FASTQ数据的简短读取划分为三个数据流,即元数据,质量分数和基本序列(S11);数据压缩步骤:对于元数据,使用增量编码来检测和消除元数据的冗余信息;关于质量分数,使用比特级PPM预测模型和用于压缩的算术编码;对于基本序列,使用固定顺序的改进算术编码进行压缩(S12);数据输出步骤,将不同数据流的压缩结果存档并合并,并在压缩之后输出最终数据(S13)。该解决方案可以提高压缩效率和压缩速度。

著录项

  • 公开/公告号WO2017214765A1

    专利类型

  • 公开/公告日2017-12-21

    原文格式PDF

  • 申请/专利权人 SHENZHEN UNIVERSITY;

    申请/专利号WO2016CN85426

  • 申请日2016-06-12

  • 分类号H03M7/30;G06F19/10;

  • 国家 WO

  • 入库时间 2022-08-21 12:46:49

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号