首页> 美国卫生研究院文献>Scientific Reports >A Novel Statistical Method to Diagnose Quantify and Correct Batch Effects in Genomic Studies
【2h】

A Novel Statistical Method to Diagnose Quantify and Correct Batch Effects in Genomic Studies

机译:诊断定量和校正基因组研究中批次效应的新型统计方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Genome projects now generate large-scale data often produced at various time points by different laboratories using multiple platforms. This increases the potential for batch effects. Currently there are several batch evaluation methods like principal component analysis (PCA; mostly based on visual inspection), and sometimes they fail to reveal all of the underlying batch effects. These methods can also lead to the risk of unintentionally correcting biologically interesting factors attributed to batch effects. Here we propose a novel statistical method, finding batch effect (findBATCH), to evaluate batch effect based on probabilistic principal component and covariates analysis (PPCCA). The same framework also provides a new approach to batch correction, correcting batch effect (correctBATCH), which we have shown to be a better approach to traditional PCA-based correction. We demonstrate the utility of these methods using two different examples (breast and colorectal cancers) by merging gene expression data from different studies after diagnosing and correcting for batch effects and retaining the biological effects. These methods, along with conventional visual inspection-based PCA, are available as a part of an R package exploring batch effect (exploBATCH; ).
机译:现在,基因组项目会生成通常由不同实验室在多个时间点使用多个平台生成的大规模数据。这增加了批量效应的可能性。当前有几种批处理评估方法,例如主成分分析(PCA;主要基于视觉检查),有时它们无法揭示所有潜在的批处理效果。这些方法也可能导致意外纠正归因于批次效应的生物学上有趣的因素的风险。在这里,我们提出了一种新的统计方法,即寻找批次效应(findBATCH),以基于概率主成分和协变量分析(PPCCA)评估批次效应。相同的框架还提供了一种批处理校正的新方法,即校正批处理效果(correctBATCH),我们已证明这是对基于PCA的传统校正的更好方法。我们通过诊断和纠正批次效应并保留生物学效应后,通过合并来自不同研究的基因表达数据,使用两个不同的例子(乳腺癌和结肠直肠癌)证明了这些方法的实用性。这些方法与常规的基于视觉检查的PCA一起,是探索批处理效果(exploBATCH;)的R包的一部分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号