首页> 外文期刊>Nucleic acids research >Sequence errors described in GenBank: a means to determine the accuracy of DNA sequence interpretation
【24h】

Sequence errors described in GenBank: a means to determine the accuracy of DNA sequence interpretation

机译:GenBank中描述的序列错误:一种确定DNA序列解释准确性的方法

获取原文
           

摘要

The accuracy of nucleic acid sequence data interpretation was determined by assessing and quantifying the discrepancies reported in the GenBank database. This permitted the calculation of an Error Rate (ER) for nucleic acid sequence determination. If one assumes that most entries (TB, Total Bases) were independently verified or those without reported discrepancies were correct, the ER is 0.368 errors per 1000 bases. However, if one assumes that only those sequences with reported discrepancies (TBIQ, Total Bases from entries In Question) were verified and are thus correct, the ER is 2.887 errors per 1000 bases. This establishes the first set of limit boundaries of the ER for sequence interpretation and sequence errors within the GenBank database and provides the foundation for future assessments and the monitoring of sequence data accumulation. In addition, the ER measure provides a basis to evaluate the efficiency and merit of present and future automated nucleic acid sequencing technologies which will have a direct impact upon the final outcome of the “Human Genome Initiative”.
机译:核酸序列数据解释的准确性是通过评估和量化GenBank数据库中报告的差异来确定的。这允许计算用于核酸序列确定的错误率(ER)。如果假设大多数条目(TB,总碱基)已独立验证或没有报告差异的条目是正确的,则ER为每1000个碱基0.368个错误。但是,如果假设只有那些报告差异的序列(TBIQ,有问题的条目的总碱基)是正确的,则ER为每1000个碱基2.887个错误。这为GenBank数据库中的序列解释和序列错误建立了ER的第一组极限边界,并为将来评估和监测序列数据积累提供了基础。此外,ER措施为评估当前和未来自动核酸测序技术的效率和优点提供了基础,这些技术将直接影响“人类基因组计划”的最终结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号