【24h】

DNA coding using finite-context models and arithmetic coding

机译:使用有限上下文模型和算术编码进行DNA编码

获取原文

摘要

The interest in DNA coding has been growing with the availability of extensive genomic databases. Although only two bits are sufficient to encode the four DNA bases, efficient lossless compression methods are still needed due to the size of DNA sequences and because standard compression algorithms do not perform well on DNA sequences. As a result, several specific coding methods have been proposed. Most of these methods are based on searching procedures for finding exact or approximate repeats. Low order finite-context models have only been used as secondary, fall back mechanisms. In this paper, we show that finite-context models can also be used as main DNA encoding methods. We propose a coding method based on two finite-context models that compete for the encoding of data, on a block by block basis. The experimental results confirm the effectiveness of the proposed method.
机译:随着广泛的基因组数据库的可用性,对DNA编码的兴趣日益增长。尽管只有两位足以编码这四个DNA碱基,但是由于DNA序列的大小以及标准压缩算法在DNA序列上的表现不佳,仍然需要有效的无损压缩方法。结果,已经提出了几种特定的编码方法。这些方法大多数都是基于搜索程序来查找精确或近似重复。低阶有限上下文模型仅用作次要的回退机制。在本文中,我们证明了有限上下文模型也可以用作主要的DNA编码方法。我们提出了一种基于两个有限上下文模型的编码方法,这些模型在逐块的基础上竞争数据的编码。实验结果证实了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号