首页> 外文期刊>Bioinformatics >Toward a statistically explicit understanding of de novo sequence assembly
【24h】

Toward a statistically explicit understanding of de novo sequence assembly

机译:对从头序列组装进行统计学上的明确理解

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Draft de novo genome assemblies are now available for many organisms. These assemblies are point estimates of the true genome sequences. Each is a specific hypothesis, drawn from among many alternative hypotheses, of the sequence of a genome. Assembly uncertainty, the inability to distinguish between multiple alternative assembly hypotheses, can be due to real variation between copies of the genome in the sample, errors and ambiguities in the sequenced data and assumptions and heuristics of the assemblers. Most assemblers select a single assembly according to ad hoc criteria, and do not yet report and quantify the uncertainty of their outputs. Those assemblers that do report uncertainty take different approaches to describing multiple assembly hypotheses and the support for each. Results: Here we review and examine the problem of representing and measuring uncertainty in assemblies. A promising recent development is the implementation of assemblers that are built according to explicit statistical models. Some new assembly methods, for example, estimate and maximize assembly likelihood. These advances, combined with technical advances in the representation of alternative assembly hypotheses, will lead to a more complete and biologically relevant understanding of assembly uncertainty. This will in turn facilitate the interpretation of downstream analyses and tests of specific biological hypotheses.
机译:动机:许多生物现在都可以使用从头开始的基因组组装。这些集合是真实基因组序列的点估计。每个都是从基因组序列的许多其他假设中得出的特定假设。装配不确定性(无法区分多个替代装配假设)可能是由于样品中基因组拷贝之间的真实差异,测序数据的错误和歧义以及装配者的假设和启发。大多数组装者根据临时标准选择单个组装,但尚未报告和量化其输出的不确定性。那些确实报告不确定性的组装商采用不同的方法来描述多个组装假设以及对每个假设的支持。结果:在这里,我们审查并检查了表示和测量装配中不确定性的问题。最近有希望的发展是根据显式统计模型构建的汇编程序的实现。例如,一些新的组装方法会估计并最大化组装可能性。这些进步与替代装配假说表示的技术进步相结合,将导致对装配不确定性的更完整和生物学相关的理解。反过来,这将有助于对特定生物学假设的下游分析和检验的解释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号