首页> 外文期刊>Bioinformatics >Gap statistics for whole genome shotgun DNA sequencing projects
【24h】

Gap statistics for whole genome shotgun DNA sequencing projects

机译:全基因组shot弹枪DNA测序项目的缺口统计

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Investigators utilize gap estimates for DNA sequencing projects. Standard theories assume sequences are independently and identically distributed, leading to appreciable under-prediction of gaps. Results: Using a statistical scaling factor and data from 20 representative whole genome shotgun projects, we construct regression equations that relate coverage to a normalized gap measure. Prokaryotic genomes do not correlate to sequence coverage, while eukaryotes show strong correlation if the chaff is ignored. Gaps decrease at an exponential rate of only about one-third of that predicted via theory alone. Case studies suggest that departure from theory can largely be attributed to assembly difficulties for repeat-rich genomes, but bias and coverage anomalies are also important when repeats are sparse. Such factors cannot be readily characterized a priori, suggesting upper limits on the accuracy of gap prediction. We also find that diminishing coverage probability discussed in other studies is a theoretical artifact that does not arise for the typical project.
机译:动机:研究人员利用缺口估计进行DNA测序项目。标准理论假设序列是独立且均匀分布的,从而导致对缺口的明显低估。结果:使用统计比例因子和来自20个代表性全基因组shot弹枪项目的数据,我们构建了将覆盖率与归一化缺口量度相关的回归方程。原核生物基因组与序列覆盖率不相关,而如果忽略谷壳,则真核生物则显示出很强的相关性。差距仅以理论上预测的三分之一的指数速度下降。案例研究表明,与理论的背离在很大程度上可以归因于富含重复序列的基因组的组装困难,但是当重复序列稀疏时,偏差和覆盖范围异常也很重要。这样的因素不能容易地被先验地表征,这提示了间隙预测的准确性的上限。我们还发现,其他研究中讨论的降低覆盖率是一种理论上的伪像,对于典型的项目而言是不会出现的。

著录项

  • 来源
    《Bioinformatics》 |2004年第10期|p. 1527-1534|共8页
  • 作者单位

    Genome Sequencing Center, Washington University School of Medicine, Box 8501, 4444 Forest Park Blvd., Saint Louis, MO 63108 USA;

    Genome Sequencing Center, Washington University School of Medicine, Box 8501, 4444 Forest Park Blvd., Saint Louis, MO 63108 USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《化学文摘》(CA);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 生物科学;
  • 关键词

  • 入库时间 2022-08-17 23:50:22

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号