...
首页> 外文期刊>Discrete Applied Mathematics >Fast multiple alignment of ungapped DNA sequences using information theory and a relaxation method
【24h】

Fast multiple alignment of ungapped DNA sequences using information theory and a relaxation method

机译:利用信息论和松弛法快速进行无缺口DNA序列的多重比对

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

An information theory based multiple alignment ("Malign") method was used to align the DNA binding sequences of the OxyR and Fis proteins, whose sequence conservation is so spread out that it is difficult to identify the sites. In the algorithm described here, the information content of the sequences is used as a unique global criterion for the quality of the alignment. The algorithm uses look-up tables to avoid recalculating computationally expensive functions such as the logarithm. Because there are no arbitrary constants and because the results are reported in absolute units (bits), the best alignment can be chosen without ambiguity. Starting from randomly selected alignments, a hill-climbing algorithm can track through the immense space of sn combinations where s is the number of sequences and n is the number of positions possible for each sequence. Instead of producing a single alignment, the algorithm is fast enough that one can afford to use many start points and to classify the solutions. Good convergence is indicated by the presence of a single well-populated solution class having higher information content than other classes. The existence of several distinct classes for the Fis protein indicates that those binding sites have self-similar features.
机译:基于信息论的多重比对(“ Malign”)方法用于比对OxyR和Fis蛋白的DNA结合序列,其序列保守性如此分散以至于难以鉴定位点。在此处描述的算法中,序列的信息内容用作比对质量的唯一全局标准。该算法使用查找表来避免重新计算计算量大的函数(例如对数)。由于没有任意常数,并且由于结果以绝对单位(位)报告,因此可以选择最佳对齐方式而不会产生歧义。从随机选择的比对开始,爬山算法可以跟踪sn组合的巨大空间,其中s是序列数,n是每个序列可能的位置数。该算法没有产生单一的对齐方式,而是足够快,以至于人们可以负担得起使用多个起点并对解决方案进行分类。良好的收敛性是由于存在一个单独的人口众多的解决方案类,该类比其他类具有更高的信息含量。 Fis蛋白有几种不同的类别,表明这些结合位点具有自相似的特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号