首页> 美国卫生研究院文献>Nucleic Acids Research >A frameshift error detection algorithm for DNA sequencing projects.
【2h】

A frameshift error detection algorithm for DNA sequencing projects.

机译:DNA测序项目的移码错误检测算法。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

During the determination of DNA sequences, frameshift errors are not the most frequent but they are the most bothersome as they corrupt the amino acid sequence over several residues. Detection of such errors by sequence alignment is only possible when related sequences are found in the databases. To avoid this limitation, we have developed a new tool based on the distribution of non-overlapping 3-tuples or 6-tuples in the three frames of an ORF. The method relies upon the result of a correspondence analysis. It has been extensively tested on Bacillus subtilis and Saccharomyces cerevisiae sequences and has also been examined with human sequences. The results indicate that it can detect frameshift errors affecting as few as 20 bp with a low rate of false positives (no more than 1.0/1000 bp scanned). The proposed algorithm can be used to scan a large collection of data, but it is mainly intended for laboratory practice as a tool for checking the quality of the sequences produced during a sequencing project.
机译:在确定DNA序列的过程中,移码错误不是最常见的,但最麻烦的是,它们会破坏多个残基上的氨基酸序列。仅当在数据库中找到相关序列时,才可以通过序列比对检测此类错误。为避免此限制,我们基于ORF的三个帧中不重叠的3元组或6元组的分布,开发了一种新工具。该方法依赖于对应分析的结果。它已在枯草芽孢杆菌和酿酒酵母序列上进行了广泛测试,并已用人序列进行了检验。结果表明,它可以检测出影响低至20 bp的移码错误,误报率低(扫描不超过1.0 / 1000 bp)。所提出的算法可用于扫描大量数据,但主要用于实验室实践,作为检查测序项目过程中产生的序列质量的工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号