首页> 美国卫生研究院文献>Elsevier Sponsored Documents >Predictive modelling using neuroimaging data in the presence of confounds
【2h】

Predictive modelling using neuroimaging data in the presence of confounds

机译:在存在混淆的情况下使用神经影像数据进行预测建模

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

When training predictive models from neuroimaging data, we typically have available non-imaging variables such as age and gender that affect the imaging data but which we may be uninterested in from a clinical perspective. Such variables are commonly referred to as ‘confounds’. In this work, we firstly give a working definition for confound in the context of training predictive models from samples of neuroimaging data. We define a confound as a variable which affects the imaging data and has an association with the target variable in the sample that differs from that in the population-of-interest, i.e., the population over which we intend to apply the estimated predictive model. The focus of this paper is the scenario in which the confound and target variable are independent in the population-of-interest, but the training sample is biased due to a sample association between the target and confound. We then discuss standard approaches for dealing with confounds in predictive modelling such as image adjustment and including the confound as a predictor, before deriving and motivating an Instance Weighting scheme that attempts to account for confounds by focusing model training so that it is optimal for the population-of-interest. We evaluate the standard approaches and Instance Weighting in two regression problems with neuroimaging data in which we train models in the presence of confounding, and predict samples that are representative of the population-of-interest. For comparison, these models are also evaluated when there is no confounding present. In the first experiment we predict the MMSE score using structural MRI from the ADNI database with gender as the confound, while in the second we predict age using structural MRI from the IXI database with acquisition site as the confound. Considered over both datasets we find that none of the methods for dealing with confounding gives more accurate predictions than a baseline model which ignores confounding, although including the confound as a predictor gives models that are less accurate than the baseline model. We do find, however, that different methods appear to focus their predictions on specific subsets of the population-of-interest, and that predictive accuracy is greater when there is no confounding present. We conclude with a discussion comparing the advantages and disadvantages of each approach, and the implications of our evaluation for building predictive models that can be used in clinical practice.
机译:当根据神经影像数据训练预测模型时,我们通常会使用诸如年龄和性别之类的非影像变量,这些变量会影响影像数据,但从临床角度来看,我们可能对此并不感兴趣。此类变量通常称为“混杂”。在这项工作中,我们首先给出在训练来自神经影像数据样本的预测模型的过程中混淆的工作定义。我们将混淆定义为一个变量,该变量会影响成像数据并与样本中的目标变量相关联,该目标变量与目标人群(即我们打算在其上应用估计的预测模型的人群)中的目标变量有所不同。本文的重点是在目标人群中混杂变量和目标变量是独立的情况,但是训练样本由于目标变量和混杂变量之间的样本关联而有偏差。然后,我们将讨论在预测模型(例如图像调整)中处理混杂问题的标准方法,并在将混杂对象作为预测变量之前进行处理,然后通过实例模型加权方案来推导和激励实例加权方案,以通过集中模型训练来解决混杂问题,从而使其适合人群-出于兴趣。我们在神经影像数据的两个回归问题中评估标准方法和实例权重,其中我们在混淆的情况下训练模型,并预测代表目标人群的样本。为了进行比较,当不存在混淆时也会评估这些模型。在第一个实验中,我们使用ADNI数据库中的结构性MRI来预测MMSE得分,而性别则是混淆的;而在第二个实验中,我们使用IXI数据库中的结构性MRI来预测年龄,以性别作为混淆点。考虑到这两个数据集,我们发现处理混杂的方法没有一个比忽略混杂的基线模型更准确的预测,尽管将混杂作为预测变量包括在内,所提供的模型比基线模型更不准确。但是,我们确实发现,不同的方法似乎将其预测集中在目标人群的特定子集上,并且当不存在混淆时,预测准确性更高。最后,我们进行了讨论,比较了每种方法的优缺点,以及我们的评估对建立可在临床实践中使用的预测模型的意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号