首页> 外文会议>2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops >Algorithm for DNA sequence compression based on prediction of mismatch bases and repeat location
【24h】

Algorithm for DNA sequence compression based on prediction of mismatch bases and repeat location

机译:基于错配碱基预测和重复定位的DNA序列压缩算法

获取原文
获取外文期刊封面目录资料

摘要

For DNA sequence Compression, it has been observed that methods based on Markov modeling and repeats give best results. However, these methods tend to use uniform distribution assumption of mismatches for approximate repeats. We show that these replacements are not uniformly distributed and we can improve compression efficiency by using non uniform distribution for mismatches. We also propose a hash table based method to predict repeat location which works well for block based genomic sequence compression algorithms. The proposed methods give good compression gains. The method can be incorporated into any algorithm that uses approximate repeats to realize similar gains.
机译:对于DNA序列压缩,已观察到基于Markov建模和重复的方法可获得最佳结果。但是,这些方法倾向于将不匹配的均匀分布假设用于近似重复。我们显示这些替换不是均匀分布的,并且可以通过使用不均匀分布的不匹配项来提高压缩效率。我们还提出了一种基于哈希表的方法来预测重复位置,该方法非常适合基于块的基因组序列压缩算法。所提出的方法具有良好的压缩增益。该方法可以合并到使用近似重复来实现相似增益的任何算法中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号