首页> 外文期刊>Bioinformatics >Sensitivity and specificity of five abundance estimators for high-density oligonucleotide microarrays
【24h】

Sensitivity and specificity of five abundance estimators for high-density oligonucleotide microarrays

机译:五个丰度估算器对高密度寡核苷酸微阵列的敏感性和特异性

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: A number of algorithms have been proposed for the processing of feature-level data from high-density oligonucleotide microarrays to give estimates of transcript abundance. Performance in the common task of detecting differential expression between samples can be quantified by the statistical concepts of sensitivity and specificity, and represented by the use of receiver operating characteristic curves. These have been previously presented for small numbers of genes known to be differentially present in spiked-in samples. We present here a study of performance over a large number (thousands) of transcripts for which there is strong evidence of differential expression, with corresponding false positive rates controlled by comparisons between replicates. Results: The straight-line regression analysis of a mixture series with replicates by five estimation algorithms produces a consensus set of 4462 transcripts with differential expression of agreed direction and high significance (p < 0.01) according to all algorithms. The more difficult task of two-sample tests between adjacent mixture levels produces performance curves of fraction true positive detected against significance level. Performance varies significantly between algorithms: at the p < 0.01 level, the detection rate varies between 41 and 66%. A control using comparisons between replicates at the same levels indicates that the tests produce empirical false positive rates closely matching the nominal p-values.
机译:动机:已经提出了许多算法来处理来自高密度寡核苷酸微阵列的特征水平数据,以提供转录本丰度的估计值。可以通过敏感性和特异性的统计概念来量化检测样品之间差异表达的常规任务中的性能,并通过使用接收器工作特性曲线来表示。先前已经针对少量已知在加标样品中差异存在的基因提出了这些方法。我们在此介绍大量(数千)转录本的性能研究,对于这些转录本,有强有力的差异表达证据,相应的假阳性率受复制之间的比较控制。结果:通过五种估计算法对具有重复项的混合序列进行直线回归分析,生成了4462个转录本的共有集,根据所有算法,该表达本具有一致的方向和高度显着性(p <0.01)。在相邻混合物水平之间进行两次样品测试的难度更大的任务会产生相对于显着性水平检测到的真实正分数的性能曲线。算法之间的性能差异很大:在p <0.01的水平上,检测率在41%和66%之间变化。使用相同水平重复样本之间的比较进行的对照表明,测试产生的经验假阳性率与标称p值非常匹配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号