首页> 外文期刊>BMC Medical Imaging >Dimensionality reduction-based fusion approaches for imaging and non-imaging biomedical data: concepts, workflow, and use-cases
【24h】

Dimensionality reduction-based fusion approaches for imaging and non-imaging biomedical data: concepts, workflow, and use-cases

机译:基于降维的成像和非成像生物医学数据融合方法:概念,工作流和用例

获取原文
       

摘要

Background With a wide array of multi-modal, multi-protocol, and multi-scale biomedical data being routinely acquired for disease characterization, there is a pressing need for quantitative tools to combine these varied channels of information. The goal of these integrated predictors is to combine these varied sources of information, while improving on the predictive ability of any individual modality. A number of application-specific data fusion methods have been previously proposed in the literature which have attempted to reconcile the differences in dimensionalities and length scales across different modalities. Our objective in this paper was to help identify metholodological choices that need to be made in order to build a data fusion technique, as it is not always clear which strategy is optimal for a particular problem. As a comprehensive review of all possible data fusion methods was outside the scope of this paper, we have focused on fusion approaches that employ dimensionality reduction (DR). Methods In this work, we quantitatively evaluate 4 non-overlapping existing instantiations of DR-based data fusion, within 3 different biomedical applications comprising over 100 studies. These instantiations utilized different knowledge representation and knowledge fusion methods, allowing us to examine the interplay of these modules in the context of data fusion. The use cases considered in this work involve the integration of (a) radiomics features from T2w MRI with peak area features from MR spectroscopy for identification of prostate cancer in vivo, (b) histomorphometric features (quantitative features extracted from histopathology) with protein mass spectrometry features for predicting 5 year biochemical recurrence in prostate cancer patients, and (c) volumetric measurements on T1w MRI with protein expression features to discriminate between patients with and without Alzheimers’ Disease. Results and conclusions Our preliminary results in these specific use cases indicated that the use of kernel representations in conjunction with DR-based fusion may be most effective, as a weighted multi-kernel-based DR approach resulted in the highest area under the ROC curve of over 0.8. By contrast non-optimized DR-based representation and fusion methods yielded the worst predictive performance across all 3 applications. Our results suggest that when the individual modalities demonstrate relatively poor discriminability, many of the data fusion methods may not yield accurate, discriminatory representations either. In summary, to outperform the predictive ability of individual modalities, methodological choices for data fusion must explicitly account for the sparsity of and noise in the feature space.
机译:背景技术随着常规采集用于疾病表征的各种各样的多模式,多协议和多尺度的生物医学数据,迫切需要定量工具来组合这些变化的信息渠道。这些综合预测器的目标是结合这些不同的信息源,同时提高任何个体模式的预测能力。先前已经在文献中提出了许多专用数据融合方法,这些方法试图调和跨不同模态的尺寸和长度尺度上的差异。本文的目的是帮助确定为建立数据融合技术而需要进行的方法学选择,因为对于特定的问题并不总是哪种策略是最佳的。由于对所有可能的数据融合方法的全面综述不在本文讨论范围之内,因此我们将重点放在采用降维(DR)的融合方法上。方法在这项工作中,我们定量评估了3种不同的生物医学应用(包括100项研究)中基于DR的数据融合的4种不重叠的现有实例。这些实例利用不同的知识表示和知识融合方法,使我们能够在数据融合的背景下检查这些模块的相互作用。在这项工作中考虑的用例涉及(a)将T2w MRI的放射学特征与MR光谱学的峰面积特征相结合,以在体内鉴定前列腺癌;(b)将组织形态特征(从组织病理学中提取的定量特征)与蛋白质质谱相结合预测前列腺癌患者5年生化复发的特征,以及(c)具有蛋白质表达特征的T1w MRI的体积测量,以区分患有和不患有阿尔茨海默氏病的患者。结果和结论我们在这些特定用例中的初步结果表明,结合基于DR的融合使用内核表示可能是最有效的,因为基于多核的加权DR方法在ROC曲线下的面积最大。超过0.8。相比之下,未经优化的基于DR的表示和融合方法在所有3个应用程序中的预测性能均最差。我们的结果表明,当单个模式显示出相对较差的可辨别性时,许多数据融合方法也可能无法产生准确的,有区别的表示。总而言之,要优于单个模式的预测能力,数据融合的方法选择必须明确考虑特征空间的稀疏性和噪声。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号