首页> 美国卫生研究院文献>other >Integration of High-Volume Molecular and Imaging Data for Composite Biomarker Discovery in the Study of Melanoma
【2h】

Integration of High-Volume Molecular and Imaging Data for Composite Biomarker Discovery in the Study of Melanoma

机译:大量的分子和成像数据的整合用于黑色素瘤研究中复合生物标志物的发现

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In this work the effects of simple imputations are studied, regarding the integration of multimodal data originating from different patients. Two separate datasets of cutaneous melanoma are used, an image analysis (dermoscopy) dataset together with a transcriptomic one, specifically DNA microarrays. Each modality is related to a different set of patients, and four imputation methods are employed to the formation of a unified, integrative dataset. The application of backward selection together with ensemble classifiers (random forests), followed by principal components analysis and linear discriminant analysis, illustrates the implication of the imputations on feature selection and dimensionality reduction methods. The results suggest that the expansion of the feature space through the data integration, achieved by the exploitation of imputation schemes in general, aids the classification task, imparting stability as regards the derivation of putative classifiers. In particular, although the biased imputation methods increase significantly the predictive performance and the class discrimination of the datasets, they still contribute to the study of prominent features and their relations. The fusion of separate datasets, which provide a multimodal description of the same pathology, represents an innovative, promising avenue, enhancing robust composite biomarker derivation and promoting the interpretation of the biomedical problem studied.
机译:在这项工作中,关于来自不同患者的多峰数据的整合,研究了简单归因的效果。使用皮肤黑素瘤的两个独立数据集,即图像分析(皮肤镜)数据集和转录组数据集,尤其是DNA微阵列。每种方式都与一组不同的患者相关,并且采用四种归因方法来形成一个统一的集成数据集。向后选择与集成分类器(随机森林)的应用,然后进行主成分分析和线性判别分析,说明了归因于特征选择和降维方法的含义。结果表明,一般而言,通过利用归因方案来实现通过数据集成来扩展特征空间,有助于分类任务,并在推定分类器的推导方面赋予稳定性。特别是,尽管有偏推算方法大大提高了数据集的预测性能和分类判别能力,但它们仍有助于研究突出特征及其关系。单独的数据集的融合提供了相同病理的多模式描述,代表了一种创新的,有希望的途径,增强了可靠的复合生物标志物的推导并促进了对所研究生物医学问题的解释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号