首页> 美国卫生研究院文献>Biostatistics (Oxford England) >Removing technical variability in RNA-seq data using conditional quantile normalization
【2h】

Removing technical variability in RNA-seq data using conditional quantile normalization

机译:使用条件分位数归一化消除RNA-seq数据中的技术差异

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The ability to measure gene expression on a genome-wide scale is one of the most promising accomplishments in molecular biology. Microarrays, the technology that first permitted this, were riddled with problems due to unwanted sources of variability. Many of these problems are now mitigated, after a decade's worth of statistical methodology development. The recently developed RNA sequencing (RNA-seq) technology has generated much excitement in part due to claims of reduced variability in comparison to microarrays. However, we show that RNA-seq data demonstrate unwanted and obscuring variability similar to what was first observed in microarrays. In particular, we find guanine-cytosine content (GC-content) has a strong sample-specific effect on gene expression measurements that, if left uncorrected, leads to false positives in downstream results. We also report on commonly observed data distortions that demonstrate the need for data normalization. Here, we describe a statistical methodology that improves precision by 42% without loss of accuracy. Our resulting conditional quantile normalization algorithm combines robust generalized regression to remove systematic bias introduced by deterministic features such as GC-content and quantile normalization to correct for global distortions.
机译:在全基因组范围内测量基因表达的能力是分子生物学中最有希望的成就之一。微阵列是最早允许这种技术的技术,由于不希望有的可变性来源而出现了很多问题。经过十年的统计方法开发,现在许多这些问题已得到缓解。最近开发的RNA测序(RNA-seq)技术引起了极大的兴奋,部分原因是与微阵列相比,可变性有所降低。但是,我们表明,RNA-seq数据显示出与微阵列中首次观察到的相似的不需要的和模糊的变异性。特别是,我们发现鸟嘌呤胞嘧啶含量(GC含量)对基因表达测量具有很强的样品特异性作用,如果不进行校正,则会导致下游结果出现假阳性。我们还报告了通常观察到的数据失真,这些数据失真说明了数据规范化的必要性。在这里,我们描述了一种统计方法,可以将精度提高42%而不会降低精度。我们产生的条件分位数归一化算法结合了稳健的广义回归,以消除由诸如GC含量和分位数归一化之类的确定性功能引入的系统偏差,以校正全局失真。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号