首页> 美国卫生研究院文献>Nucleic Acids Research >Sequence errors described in GenBank: a means to determine the accuracy of DNA sequence interpretation.
【2h】

Sequence errors described in GenBank: a means to determine the accuracy of DNA sequence interpretation.

机译:GenBank中描述的序列错误:一种确定DNA序列解释准确性的方法。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The accuracy of nucleic acid sequence data interpretation was determined by assessing and quantifying the discrepancies reported in the GenBank database. This permitted the calculation of an Error Rate (ER) for nucleic acid sequence determination. If one assumes that most entries (TB, Total Bases) were independently verified or those without reported discrepancies were correct, the ER is 0.368 errors per 1000 bases. However, if one assumes that only those sequences with reported discrepancies (TBIQ, Total Bases from entries In Question) were verified and are thus correct, the ER is 2.887 errors per 1000 bases. This establishes the first set of limit boundaries of the ER for sequence interpretation and sequence errors within the GenBank database and provides the foundation for future assessments and the monitoring of sequence data accumulation. In addition, the ER measure provides a basis to evaluate the efficiency and merit of present and future automated nucleic acid sequencing technologies which will have a direct impact upon the final outcome of the "Human Genome Initiative".
机译:核酸序列数据解释的准确性是通过评估和量化GenBank数据库中报告的差异来确定的。这允许计算用于核酸序列确定的错误率(ER)。如果假设大多数条目(TB,总碱基)已被独立验证或没有报告差异的条目是正确的,则ER为每1000个碱基0.368个错误。但是,如果假设只有那些报告差异的序列(TBIQ,有问题的条目的总碱基)被验证并且是正确的,则ER为每1000个碱基2.887个错误。这为GenBank数据库中的序列解释和序列错误建立了ER的第一组极限边界,并为将来的评估和监测序列数据的积累提供了基础。此外,ER措施为评估当前和未来自动核酸测序技术的效率和优点提供了基础,这些技术将直接影响“人类基因组计划”的最终结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号