首页> 美国卫生研究院文献>Molecular Biology and Evolution >Improved Algorithmic Complexity for the 3SEQ Recombination Detection Algorithm
【2h】

Improved Algorithmic Complexity for the 3SEQ Recombination Detection Algorithm

机译:3SEQ重组检测算法的改进算法复杂度

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Identifying recombinant sequences in an era of large genomic databases is challenging as it requires an efficient algorithm to identify candidate recombinants and parents, as well as appropriate statistical methods to correct for the large number of comparisons performed. In 2007, a computation was introduced for an exact nonparametric mosaicism statistic that gave high-precision P values for putative recombinants. This exact computation meant that multiple-comparisons corrected P values also had high precision, which is crucial when performing millions or billions of tests in large databases. Here, we introduce an improvement to the algorithmic complexity of this computation from O(mn3) to O(mn2), where m and n are the numbers of recombination-informative sites in the candidate recombinant. This new computation allows for recombination analysis to be performed in alignments with thousands of polymorphic sites. Benchmark runs are presented on viral genome sequence alignments, new features are introduced, and applications outside recombination analysis are discussed.
机译:在大型基因组数据库时代,鉴定重组序列具有挑战性,因为它需要一种有效的算法来鉴定候选重组体和亲本,并需要适当的统计方法来校正大量的比较。 2007年,针对精确的非参数镶嵌统计进行了计算,从而为推定的重组子提供了高精度的P值。这种精确的计算意味着经过多次比较的校正后的P值也具有很高的精度,这在大型数据库中执行数百万或数十亿次测试时至关重要。在此,我们将这种计算的算法复杂度从O(mn 3 )改进为O(mn 2 ),其中m和n是重组数-候选重组子中的信息性位点。这种新的计算方法使得重组分析可以与成千上万的多态性位点进行比对。提出了关于病毒基因组序列比对的基准试验,引入了新功能,并讨论了重组分析以外的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号