首页> 美国卫生研究院文献>other >Prediction-Oriented Marker Selection (PROMISE): With Application to High-Dimensional Regression
【2h】

Prediction-Oriented Marker Selection (PROMISE): With Application to High-Dimensional Regression

机译:面向预测的标记选择(PROMISE):在高维回归中的应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In personalized medicine, biomarkers are used to select therapies with the highest likelihood of success based on an individual patient’s biomarker/genomic profile. Two goals are to choose important biomarkers that accurately predict treatment outcomes and to cull unimportant biomarkers to reduce the cost of biological and clinical verifications. These goals are challenging due to the high dimensionality of genomic data. Variable selection methods based on penalized regression (e.g., the lasso and elastic net) have yielded promising results. However, selecting the right amount of penalization is critical to simultaneously achieving these two goals. Standard approaches based on cross-validation (CV) typically provide high prediction accuracy with high true positive rates but at the cost of too many false positives. Alternatively, stability selection (SS) controls the number of false positives, but at the cost of yielding too few true positives. To circumvent these issues, we propose prediction-oriented marker selection (PROMISE), which combines SS with CV to conflate the advantages of both methods. Our application of PROMISE with the lasso and elastic net in data analysis shows that, compared to CV, PROMISE produces sparse solutions, few false positives, and small type I + type II error, and maintains good prediction accuracy, with a marginal decrease in the true positive rates. Compared to SS, PROMISE offers better prediction accuracy and true positive rates. In summary, PROMISE can be applied in many fields to select regularization parameters when the goals are to minimize false positives and maximize prediction accuracy.
机译:在个性化医学中,生物标志物用于根据个体患者的生物标志物/基因组概况选择成功可能性最高的疗法。两个目标是选择能够准确预测治疗结果的重要生物标志物,并剔除不重要的生物标志物以降低生物学和临床验证的成本。这些目标由于基因组数据的高维度而具有挑战性。基于惩罚回归的变量选择方法(例如套索和弹性网)产生了可喜的结果。但是,选择适当的罚款量对于同时实现这两个目标至关重要。基于交叉验证(CV)的标准方法通常可提供较高的预测精度和较高的真实阳性率,但会带来太多的阳性率。或者,稳定性选择(SS)控制假阳性的数量,但以产生太少的真阳性为代价。为了避免这些问题,我们提出了面向预测的标记选择(PROMISE),它将SS与CV结合起来以融合两种方法的优点。我们在套索和弹性网中使用PROMISE进行数据分析的结果表明,与CV相比,PROMISE产生的稀疏解,假阳性少,I + II型误差小,并保持了良好的预测准确性,而CIM略有下降。真正的积极率。与SS相比,PROMISE提供了更好的预测准确性和真实的阳性率。总而言之,当目标是最大程度地减少误报和最大化预测准确性时,PROMISE可应用于许多领域以选择正则化参数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号