首页> 美国卫生研究院文献>BMC Bioinformatics >Controlling false discoveries in high-dimensional situations: boosting with stability selection
【2h】

Controlling false discoveries in high-dimensional situations: boosting with stability selection

机译:在高维情况下控制错误发现:通过稳定性选择来增强

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundModern biotechnologies often result in high-dimensional data sets with many more variables than observations (n≪p). These data sets pose new challenges to statistical analysis: Variable selection becomes one of the most important tasks in this setting. Similar challenges arise if in modern data sets from observational studies, e.g., in ecology, where flexible, non-linear models are fitted to high-dimensional data. We assess the recently proposed flexible framework for variable selection called stability selection. By the use of resampling procedures, stability selection adds a finite sample error control to high-dimensional variable selection procedures such as Lasso or boosting. We consider the combination of boosting and stability selection and present results from a detailed simulation study that provide insights into the usefulness of this combination. The interpretation of the used error bounds is elaborated and insights for practical data analysis are given.
机译:背景技术现代生物技术通常会导致高维数据集,其变量多于观测值(n≪p)。这些数据集给统计分析带来了新的挑战:变量选择成为此设置中最重要的任务之一。如果在来自观察研究的现代数据集中,例如在生态学中,将灵活的非线性模型拟合到高维数据中,则会遇到类似的挑战。我们评估了最近提出的用于变量选择的灵活框架,称为稳定性选择。通过使用重采样过程,稳定性选择将有限采样误差控制添加到高维度变量选择过程(例如套索或增强)。我们考虑了增强和稳定性选择的组合,并给出了详细的仿真研究结果,这些结果提供了对该组合有用性的见解。详细说明了所使用的误差范围,并给出了实用数据分析的见解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号