首页> 外文会议>IEEE-EMBS International Conference on Biomedical and Health Informatics >Evaluating the impact of sequencing error correction for RNA-seq data with ERCC RNA spike-in controls
【24h】

Evaluating the impact of sequencing error correction for RNA-seq data with ERCC RNA spike-in controls

机译:使用ERCC RNA插入对照评估RNA-seq数据的测序错误校正的影响

获取原文

摘要

Sequencing errors are a major issue for several next-generation sequencing-based applications such as de novo assembly and single nucleotide polymorphism detection. Several error-correction methods have been developed to improve raw data quality. However, error-correction performance is hard to evaluate because of the lack of a ground truth. In this study, we propose a novel approach which using ERCC RNA spike-in controls as the ground truth to facilitate error-correction performance evaluation. After aligning raw and corrected RNA-seq data, we characterized the quality of reads by three metrics: mismatch patterns (i.e., the substitution rate of A to C) of reads aligned with one mismatch, mismatch patterns of reads aligned with two mismatches and the percentage increase of reads aligned to reference. We observed that the mismatch patterns for reads aligned with one mismatch are significantly correlated between ERCC spike-ins and real RNA samples. Based on such observations, we conclude that ERCC spike-ins can serve as ground truths for error correction beyond their previous applications for validation of dynamic range and fold-change response. Also, the mismatch patterns for ERCC reads aligned with one mismatch can serve as a novel and reliable metric to evaluate the performance of error-correction tools.
机译:测序错误是一些基于新一代测序的应用程序的主要问题,例如从头组装和单核苷酸多态性检测。已经开发了几种纠错方法来改善原始数据质量。但是,由于缺乏基本事实,因此很难评估纠错性能。在这项研究中,我们提出了一种新方法,该方法使用ERCC RNA刺入对照作为基本事实,以促进错误校正性能评估。在对原始和校正后的RNA-seq数据进行比对后,我们通过三个指标对读取的质量进行了表征:与一个错配对齐的读数的错配模式(即,A到C的取代率),与两个错配对齐的读数的错配模式以及与参考对齐的读数的百分比增加。我们观察到,与一个错配对齐的读数的错配模式与ERCC突入和真实RNA样品之间显着相关。基于这样的观察,我们得出结论,ERCC尖峰插入可以作为错误校正的基础,超出了他们先前用于验证动态范围和倍数变化响应的应用程序。同样,与一个不匹配对齐的ERCC读取的不匹配模式可以用作评估错误校正工具性能的新颖而可靠的指标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号