首页> 外文期刊>Metabolites >A Multilevel Bayesian Approach to Improve Effect Size Estimation in Regression Modeling of Metabolomics Data Utilizing Imputation with Uncertainty
【24h】

A Multilevel Bayesian Approach to Improve Effect Size Estimation in Regression Modeling of Metabolomics Data Utilizing Imputation with Uncertainty

机译:一种多级贝叶斯方法,提高代谢组族数据回归建模效应估计利用不确定性

获取原文
           

摘要

To ensure scientific reproducibility of metabolomics data, alternative statistical methods are needed. A paradigm shift away from the p -value toward an embracement of uncertainty and interval estimation of a metabolite’s true effect size may lead to improved study design and greater reproducibility. Multilevel Bayesian models are one approach that offer the added opportunity of incorporating imputed value uncertainty when missing data are present. We designed simulations of metabolomics data to compare multilevel Bayesian models to standard logistic regression with corrections for multiple hypothesis testing. Our simulations altered the sample size and the fraction of significant metabolites truly different between two outcome groups. We then introduced missingness to further assess model performance. Across simulations, the multilevel Bayesian approach more accurately estimated the effect size of metabolites that were significantly different between groups. Bayesian models also had greater power and mitigated the false discovery rate. In the presence of increased missing data, Bayesian models were able to accurately impute the true concentration and incorporating the uncertainty of these estimates improved overall prediction. In summary, our simulations demonstrate that a multilevel Bayesian approach accurately quantifies the estimated effect size of metabolite predictors in regression modeling, particularly in the presence of missing data.
机译:为确保代谢组数据的科学再现性,需要替代统计方法。远离P-value朝着拥抱的value偏离p-value的差异和间隔估计代谢物的真实效果大小可能导致改善的研究设计和更高的再现性。多级贝叶斯模型是一种方法,可以在存在缺失数据时纳入普通值不确定性的增加的机会。我们设计了对代谢组合数据的模拟,将多级贝叶斯模型与多个假设检测进行校正进行标准逻辑回归。我们的模拟改变了样品尺寸,并且在两种结果组之间真正不同的显着代谢物的分数。然后我们引入了失踪,以进一步评估模型性能。在模拟中,多级贝叶斯方法更准确地估计在群体之间具有显着差异的代谢物的效果大小。贝叶斯模型也有更大的力量并减轻了虚假的发现率。在增加缺失数据的情况下,贝叶斯模型能够准确地赋予真正的浓度,并结合这些估计的不确定性改善了整体预测。总之,我们的模拟表明,多级贝叶斯方法准确地量化了回归建模中代谢物预测器的估计效果大小,特别是在存在缺失数据的情况下。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号