首页>
外国专利>
MULTI-THREAD FAST STORAGE LOSSLESS COMPRESSION METHOD AND SYSTEM FOR FASTQ DATA
MULTI-THREAD FAST STORAGE LOSSLESS COMPRESSION METHOD AND SYSTEM FOR FASTQ DATA
展开▼
机译:FASTQ数据的多线程快速存储无损压缩方法和系统
展开▼
页面导航
摘要
著录项
相似文献
摘要
Provided is a multi-thread fast storage lossless compression method for FASTQ data, which is applied to compression of a DNA sequence. The method comprises: a data classification step of inputting original FASTQ data, and dividing a short reading of the original FASTQ data into three data flows, namely metadata, a mass fraction, and a base sequence (S11); a data compression step of: with respect to the metadata, using incremental encoding to detect and eliminate redundant information of the metadata; with respect to the mass fraction, using a bit level PPM prediction model and arithmetic coding for compression; and with respect to the base sequence, using improved arithmetic coding of a fixed order for compression (S12); and a data output step of archiving and merging compression results of different data flows, and outputting final data after the compression (S13). The solution can improve the compression efficiency and compression speed.
展开▼