首页> 美国卫生研究院文献>other >Determination of Minimum Training Sample Size for Microarray-Based Cancer Outcome Prediction–An Empirical Assessment
【2h】

Determination of Minimum Training Sample Size for Microarray-Based Cancer Outcome Prediction–An Empirical Assessment

机译:确定基于微阵列的癌症结果预测的最小培训样本量–实证评估

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The promise of microarray technology in providing prediction classifiers for cancer outcome estimation has been confirmed by a number of demonstrable successes. However, the reliability of prediction results relies heavily on the accuracy of statistical parameters involved in classifiers. It cannot be reliably estimated with only a small number of training samples. Therefore, it is of vital importance to determine the minimum number of training samples and to ensure the clinical value of microarrays in cancer outcome prediction. We evaluated the impact of training sample size on model performance extensively based on 3 large-scale cancer microarray datasets provided by the second phase of MicroArray Quality Control project (MAQC-II). An SSNR-based (scale of signal-to-noise ratio) protocol was proposed in this study for minimum training sample size determination. External validation results based on another 3 cancer datasets confirmed that the SSNR-based approach could not only determine the minimum number of training samples efficiently, but also provide a valuable strategy for estimating the underlying performance of classifiers in advance. Once translated into clinical routine applications, the SSNR-based protocol would provide great convenience in microarray-based cancer outcome prediction in improving classifier reliability.
机译:许多可证明的成功证实了微阵列技术在提供预测癌症结果的预测分类器方面的前景。但是,预测结果的可靠性在很大程度上取决于分类器中涉及的统计参数的准确性。仅使用少量训练样本就不能可靠地估计它。因此,确定培训样本的最少数量并确保微阵列在癌症结果预测中的临床价值至关重要。我们基于MicroArray质量控制项目(MAQC-II)第二阶段提供的3个大规模癌症微阵列数据集,广泛评估了训练样本量对模型性能的影响。在本研究中,提出了一种基于SSNR(信噪比的标度)的协议,用于确定最小训练样本大小。基于另外3个癌症数据集的外部验证结果证实,基于SSNR的方法不仅可以有效地确定最小数量的训练样本,而且还可以提供一种有价值的策略,用于提前估计分类器的基本性能。一旦转化为临床常规应用,基于SSNR的协议将为基于微阵列的癌症结果预测提供极大的便利,从而提高了分类器的可靠性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号