首页> 外文期刊>Genetics Selection Evolution >Analysis of a simulated microarray dataset: Comparison of methods for data normalisation and detection of differential expression (Open Access publication)
【24h】

Analysis of a simulated microarray dataset: Comparison of methods for data normalisation and detection of differential expression (Open Access publication)

机译:模拟微阵列数据集的分析:数据归一化和差异表达检测方法的比较(开放获取出版物)

获取原文
获取原文并翻译 | 示例
       

摘要

Microarrays allow researchers to measure the expression of thousands of genes in a single experiment. Before statistical comparisons can be made, the data must be assessed for quality and normalisation procedures must be applied, of which many have been proposed. Methods of comparing the normalised data are also abundant, and no clear consensus has yet been reached. The purpose of this paper was to compare those methods used by the EADGENE network on a very noisy simulated data set. With the a priori knowledge of which genes are differentially expressed, it is possible to compare the success of each approach quantitatively. Use of an intensity-dependent normalisation procedure was common, as was correction for multiple testing. Most variety in performance resulted from differing approaches to data quality and the use of different statistical tests. Very few of the methods used any kind of background correction. A number of approaches achieved a success rate of 95% or above, with relatively small numbers of false positives and negatives. Applying stringent spot selection criteria and elimination of data did not improve the false positive rate and greatly increased the false negative rate. However, most approaches performed well, and it is encouraging that widely available techniques can achieve such good results on a very noisy data set.
机译:微阵列使研究人员可以在单个实验中测量数千种基因的表达。在进行统计比较之前,必须对数据进行质量评估和标准化程序,其中许多已经提出。比较标准化数据的方法也很丰富,并且尚未达成明确共识。本文的目的是在非常嘈杂的模拟数据集上比较EADGENE网络使用的那些方法。在了解哪些基因差异表达的先验知识之后,有可能定量地比较每种方法的成功。通常使用强度相关的归一化程序,以及多次测试的校正。性能上的大多数差异是由于采用了不同的数据质量方法以及使用了不同的统计测试所致。很少有方法使用任何类型的背景校正。许多方法的成功率达到95%或以上,假阳性和阴性的数量相对较少。应用严格的斑点选择标准和消除数据并不能提高假阳性率,而大大提高了假阴性率。但是,大多数方法都执行良好,令人鼓舞的是,广泛使用的技术可以在非常嘈杂的数据集上取得如此好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号