首页> 外文期刊>BMC Bioinformatics >Tilting the lasso by knowledge-based post-processing
【24h】

Tilting the lasso by knowledge-based post-processing

机译:通过基于知识的后处理倾斜套索

获取原文
       

摘要

Background It is useful to incorporate biological knowledge on the role of genetic determinants in predicting an outcome. It is, however, not always feasible to fully elicit this information when the number of determinants is large. We present an approach to overcome this difficulty. First, using half of the available data, a shortlist of potentially interesting determinants are generated. Second, binary indications of biological importance are elicited for this much smaller number of determinants. Third, an analysis is carried out on this shortlist using the second half of the data. Results We show through simulations that, compared with adaptive lasso, this approach leads to models containing more biologically relevant variables, while the prediction mean squared error (PMSE) is comparable or even reduced. We also apply our approach to bone mineral density data, and again final models contain more biologically relevant variables and have reduced PMSEs. Conclusion Our method leads to comparable or improved predictive performance, and models with greater face validity and interpretability with feasible incorporation of biological knowledge into predictive models.
机译:背景技术将有关遗传决定因素在预测结果中作用的生物学知识整合在一起是有用的。但是,当行列式的数量很大时,并非总是可行的。我们提出一种克服这一困难的方法。首先,使用一半的可用数据,生成可能有趣的行列式的候选清单。第二,对于数量少得多的决定因素,产生了生物学重要性的二进制指示。第三,使用数据的后半部分对此入围列表进行分析。结果我们通过仿真显示,与自适应套索相比,该方法可导致包含更多生物学相关变量的模型,而预测均方误差(PMSE)可比甚至更低。我们还将我们的方法应用于骨矿物质密度数据,并且最终模型再次包含更多与生物学相关的变量,并降低了PMSE。结论我们的方法可导致可比或改进的预测性能,并且具有将面孔知识和可解释性以及将生物学知识切实可行地纳入预测模型的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号