首页> 外文期刊>BMC Bioinformatics >Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data
【24h】

Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data

机译:微阵列数据的可重复性:微阵列质量控制(MAQC)数据的进一步分析

获取原文
           

摘要

Background Many researchers are concerned with the comparability and reliability of microarray gene expression data. Recent completion of the MicroArray Quality Control (MAQC) project provides a unique opportunity to assess reproducibility across multiple sites and the comparability across multiple platforms. The MAQC analysis presented for the conclusion of inter- and intra-platform comparability/reproducibility of microarray gene expression measurements is inadequate. We evaluate the reproducibility/comparability of the MAQC data for 12901 common genes in four titration samples generated from five high-density one-color microarray platforms and the TaqMan technology. We discuss some of the problems with the use of correlation coefficient as metric to evaluate the inter- and intra-platform reproducibility and the percent of overlapping genes (POG) as a measure for evaluation of a gene selection procedure by MAQC. Results A total of 293 arrays were used in the intra- and inter-platform analysis. A hierarchical cluster analysis shows distinct differences in the measured intensities among the five platforms. A number of genes show a small fold-change in one platform and a large fold-change in another platform, even though the correlations between platforms are high. An analysis of variance shows thirty percent of gene expressions of the samples show inconsistent patterns across the five platforms. We illustrated that POG does not reflect the accuracy of a selected gene list. A non-overlapping gene can be truly differentially expressed with a stringent cut, and an overlapping gene can be non-differentially expressed with non-stringent cutoff. In addition, POG is an unusable selection criterion. POG can increase or decrease irregularly as cutoff changes; there is no criterion to determine a cutoff so that POG is optimized. Conclusion Using various statistical methods we demonstrate that there are differences in the intensities measured by different platforms and different sites within platform. Within each platform, the patterns of expression are generally consistent, but there is site-by-site variability. Evaluation of data analysis methods for use in regulatory decision should take no treatment effect into consideration, when there is no treatment effect, "a fold-change cutoff with a non-stringent p-value cutoff" could result in 100% false positive error selection.
机译:背景许多研究人员关注微阵列基因表达数据的可比性和可靠性。 MicroArray质量控制(MAQC)项目的最新完成提供了独特的机会来评估多个站点之间的可重复性以及多个平台之间的可比性。提出的用于总结微阵列基因表达测量结果的平台间和平台内可比性/再现性的MAQC分析是不充分的。我们评估了从五个高密度单色微阵列平台和TaqMan技术生成的四个滴定样品中的12901个常见基因的MAQC数据的可重复性/可比性。我们讨论了使用相关系数作为度量标准来评估平台间和平台内可重复性以及重叠基因百分比(POG)作为通过MAQC评估基因选择程序的一种方法的一些问题。结果总共293个阵列用于平台内和平台间分析。层次聚类分析显示了五个平台之间测量强度的明显差异。尽管平台之间的相关性很高,但许多基因在一个平台上显示出小的折叠变化,而在另一个平台上显示出大的折叠变化。方差分析表明,样本的基因表达的百分之三十在五个平台上均表现出不一致的模式。我们说明了POG不能反映所选基因列表的准确性。非重叠基因可通过严格切割而真正地差异表达,而重叠基因可通过非严格切割而非差异地表达。另外,POG是不可用的选择标准。随着临界值的变化,POG会不规则地增加或减少;没有确定临界值的标准,以便优化POG。结论使用各种统计方法,我们证明了不同平台和平台内不同站点测得的强度存在差异。在每个平台内,表达模式通常是一致的,但每个站点都有可变性。评估用于监管决策的数据分析方法时,不应考虑任何治疗效果,如果没有治疗效果,“倍数变化临界值具有严格的p值临界值”可能会导致100%错误的阳性错误选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号