...
首页> 外文期刊>Journal of Bioinformatics and Computational Biology >ALIGNMENT OF MINISATELLITE MAPS BASED ON RUN-LENGTH ENCODING SCHEME
【24h】

ALIGNMENT OF MINISATELLITE MAPS BASED ON RUN-LENGTH ENCODING SCHEME

机译:基于游程编码方案的微型卫星地图对准

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Subsequent duplication events are responsible for the evolution of the minisatellite maps. Alignment of two minisatellite maps should therefore take these duplication events into account, in addition to the well-known edit operations. All algorithms for computing an optimal alignment of two maps, including the one presented here, first deduce the costs of optimal duplication scenarios for all substrings of the given maps. Then, they incorporate the pre-computed costs in the alignment recurrence. However, all previous algorithms addressing this problem are dependent on the number of distinct map units (map alphabet) and do not fully make use of the repetitiveness of the map units. In this paper, we present an algorithm that remedies these shortcomings: our algorithm is alphabet-independent and is based on the run-length encoding scheme. It is the fastest in theory, and in practice as well, as shown by experimental results. Furthermore, our alignment model is more general than that of the previous algorithms, and captures better the duplication mechanism. Using our algorithm, we derive a quantitative evidence that there is a directional bias in the growth of minisatellites of the MSY1 dataset.
机译:随后的复制事件是微型卫星图演变的原因。因此,除了众所周知的编辑操作之外,两个小卫星图的对齐应该考虑这些复制事件。用于计算两个图的最佳对齐方式的所有算法(包括此处介绍的算法)首先得出针对给定图的所有子串的最佳复制方案的成本。然后,它们将预先计算的成本合并到比对重复中。但是,解决该问题的所有先前算法均取决于不同地图单元(地图字母)的数量,并且没有完全利用地图单元的重复性。在本文中,我们提出了一种弥补这些缺点的算法:我们的算法与字母无关,并且基于游程长度编码方案。实验结果表明,它在理论上和实践上都是最快的。此外,我们的对齐模型比以前的算法更通用,并且可以更好地捕获复制机制。使用我们的算法,我们得出了定量的证据,表明MSY1数据集的小卫星的生长存在方向偏差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号