首页> 外文会议>International Conference on Practical Applications of Computational Biology Bioinformatics >Substitutional Tolerant Markov Models for Relative Compression of DNA Sequences
【24h】

Substitutional Tolerant Markov Models for Relative Compression of DNA Sequences

机译:DNA序列相对压缩的替代性耐受性马尔可夫模型

获取原文

摘要

Referential compression is one of the fundamental operations for storing and analyzing DNA data. The models that incorporate relative compression, a special case of referential compression, are being steadily improved, namely those which are based on Markov models. In this paper, we propose a new model, the substitutional tolerant Markov model (STMM), which can be used in cooperation with regular Markov models to improve compression efficiency. We assessed its impact on synthetic and real DNA sequences, showing a substantial improvement in compression, while only slightly increasing the computation time. In particular, it shows high efficiency in modeling species that have split less than 40 million years ago.
机译:参考压缩是用于存储和分析DNA数据的基本操作之一。包含相对压缩的模型是稳定地改善了相对压缩的特殊情况,即基于马尔可夫模型的那些。在本文中,我们提出了一种新的模型,替代耐受性马尔可夫模型(STMM),可与常规马尔可夫模型合作,以提高压缩效率。我们评估了对合成和真实DNA序列的影响,显示了压缩的显着改善,同时仅略微增加计算时间。特别是,它表现出在不到4000万年前分裂的建模物种的高效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号