首页> 外国专利> METHOD AND DEVICE FOR IDENTIFYING EXTENSION CONFLICT AND DETERMINING CONFIDENCE LEVEL OF SEED READ IN NUCLEOTIDE SEQUENCE ASSEMBLY

METHOD AND DEVICE FOR IDENTIFYING EXTENSION CONFLICT AND DETERMINING CONFIDENCE LEVEL OF SEED READ IN NUCLEOTIDE SEQUENCE ASSEMBLY

机译:鉴定核苷酸序列装配中种子阅读的延伸冲突和确定可信度的方法和装置

摘要

Disclosed is a method for identifying extension conflict and determining a confidence level of a seed read in nucleotide sequence assembly. The method comprises: selecting, from reads for gap closure, all reads that overlap one end of a first contig close to a gap and taking the all reads as a gap closure read set, and selecting, from the gap closure read set, a read having the shortest overlap as a seed read; determining whether the gap closure read set has a read having the length of an overlap with the first contig being shorter than the length of an overlap between the seed read and the first contig, and whether the gap closure read set has a read that does not overlap the seed read; if any one of the two determination results is yes, indicating that extension conflict occurs, and determine that the seed read is inconvincible; reselecting a convincible seed read, and splicing the seed read and the first contig, so as to perform the gap closure. Further disclosed is an apparatus for identifying extension conflict and determining a confidence level of a seed read in nucleotide sequence assembly.
机译:公开了一种用于识别延伸冲突并确定核苷酸序列装配中读取的种子的置信度的方法。该方法包括:从用于间隙闭合的读段中选择与第一重叠群的靠近间隙的末端重叠的所有读段,并将所有读段作为间隙闭合读段,并从间隙闭合读段中选择一个读段。作为种子阅读,重叠时间最短;确定所述间隙封闭阅读集是否具有与所述第一重叠群重叠的长度短于所述种子阅读与所述第一重叠群之间的重叠长度的阅读,以及所述间隙封闭读集合是否具有不具有所述第一重叠群的重叠长度的阅读。重叠读取的种子;如果两个确定结果中的任何一个是,则表明发生了扩展冲突,并确定种子读取是不可信的;重新选择易读的种子阅读,并拼接种子阅读和第一个重叠群,以进行缺口闭合。还公开了一种用于识别延伸冲突并确定核苷酸序列装配中读取的种子的置信度的设备。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号