首页> 外文期刊>Computational Biology and Bioinformatics, IEEE/ACM Transactions on >ResSeq: Enhancing Short-Read Sequencing Alignment By Rescuing Error-Containing Reads
【24h】

ResSeq: Enhancing Short-Read Sequencing Alignment By Rescuing Error-Containing Reads

机译:ResSeq:通过挽救含错误的读段来增强短读序列比对

获取原文
获取原文并翻译 | 示例
       

摘要

Next-generation short-read sequencing is widely utilized in genomic studies. Biological applications require an alignment step to map sequencing reads to the reference genome, before acquiring expected genomic information. This requirement makes alignment accuracy a key factor for effective biological interpretation. Normally, when accounting for measurement errors and single nucleotide polymorphisms, short read mappings with a few mismatches are generally considered acceptable. However, to further improve the efficiency of short-read sequencing alignment, we propose a method to retrieve additional reliably aligned reads (reads with more than a pre-defined number of mismatches), using a Bayesian-based approach. In this method, we first retrieve the sequence context around the mismatched nucleotides within the already aligned reads; these loci contain the genomic features where sequencing errors occur. Then, using the derived pattern, we evaluate the remaining (typically discarded) reads with more than the allowed number of mismatches, and calculate a score that represents the probability that a specific alignment is correct. This strategy allows the extraction of more reliably aligned reads, therefore improving alignment sensitivity. Implementation: The source code of our tool, ResSeq, can be downloaded from: https://github.com/hrbeubiocenter/Resseq.
机译:下一代短读测序已广泛用于基因组研究中。在获取预期的基因组信息之前,生物学应用需要一个比对步骤来将测序读数映射到参考基因组。该要求使对准精度成为有效生物学解释的关键因素。通常,当考虑到测量误差和单核苷酸多态性时,通常认为具有少量错配的短读映射是可接受的。但是,为了进一步提高短读序列比对的效率,我们提出了一种使用基于贝叶斯的方法来检索其他可靠比对的读码(错配数量超过预定义数量的读码)的方法。在这种方法中,我们首先在已经比对的读段内检索错配核苷酸周围的序列上下文。这些基因座包含发生测序错误的基因组特征。然后,使用派生的模式,我们以不匹配的允许数量评估剩余的(通常丢弃的)读段,并计算一个分数,该分数代表特定比对正确的可能性。该策略允许提取更可靠的比对读数,从而提高比对灵敏度。实施:我们的工具ResSeq的源代码可以从以下网址下载:https://github.com/hrbeubiocenter/Resseq。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号