首页> 外文OA文献 >Summarizing and correcting the GC content bias in high-throughput sequencing
【2h】

Summarizing and correcting the GC content bias in high-throughput sequencing

机译:总结和纠正高通量测序中GC含量偏差

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

GC content bias describes the dependence between fragment count (read coverage) and GC content found in Illumina sequencing data. This bias can dominate the signal of interest for analyses that focus on measuring fragment abundance within a genome, such as copy number estimation (DNA-seq). The bias is not consistent between samples; and there is no consensus as to the best methods to remove it in a single sample. We analyze regularities in the GC bias patterns, and find a compact description for this unimodal curve family. It is the GC content of the full DNA fragment, not only the sequenced read, that most influences fragment count. This GC effect is unimodal: both GC-rich fragments and AT-rich fragments are underrepresented in the sequencing results. This empirical evidence strengthens the hypothesis that PCR is the most important cause of the GC bias. We propose a model that produces predictions at the base pair level, allowing strand-specific GC-effect correction regardless of the downstream smoothing or binning. These GC modeling considerations can inform other high-throughput sequencing analyses such as ChIP-seq and RNA-seq.
机译:GC含量偏差描述了片段计数(读取覆盖率)与Illumina测序数据中发现的GC含量之间的相关性。这种偏倚可以控制感兴趣的信号,从而集中进行测量基因组中片段丰度的分析,例如拷贝数估计(DNA-seq)。样本之间的偏差不一致。对于在单个样品中去除它的最佳方法尚无共识。我们分析了GC偏差模式中的规律性,并为该单峰曲线系列找到了简洁的描述。完整DNA片段的GC含量(不仅是测序读数),对片段计数的影响最大。这种GC效应是单峰的:富含GC的片段和富含AT的片段在测序结果中均不足。这一经验证据进一步证实了PCR是GC偏差最重要原因的假说。我们提出了一个可在碱基对水平上产生预测的模型,无论下游平滑或装仓如何,都可进行链特异性GC效应校正。这些GC建模考虑因素可以为其他高通量测序分析提供参考,例如ChIP-seq和RNA-seq。

著录项

  • 作者单位
  • 年度 2012
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号