...
首页> 外文期刊>BioMed research international >Integration of High-Volume Molecular and Imaging Data for Composite Biomarker Discovery in the Study of Melanoma
【24h】

Integration of High-Volume Molecular and Imaging Data for Composite Biomarker Discovery in the Study of Melanoma

机译:大量的分子和成像数据的整合,用于黑色素瘤研究中复合生物标志物的发现

获取原文
获取原文并翻译 | 示例
           

摘要

In this work the effects of simple imputations are studied, regarding the integration of multimodal data originating from different patients. Two separate datasets of cutaneous melanoma are used, an image analysis (dermoscopy) dataset together with a transcriptomic one, specifically DNA microarrays. Each modality is related to a different set of patients, and four imputation methods are employed to the formation of a unified, integrative dataset. The application of backward selection together with ensemble classifiers (random forests), followed by principal components analysis and linear discriminant analysis, illustrates the implication of the imputations on feature selection and dimensionality reduction methods. The results suggest that the expansion of the feature space through the data integration, achieved by the exploitation of imputation schemes in general, aids the classification task, imparting stability as regards the derivation of putative classifiers. In particular, although the biased imputation methods increase significantly the predictive performance and the class discrimination of the datasets, they still contribute to the study of prominent features and their relations. The fusion of separate datasets, which provide a multimodal description of the same pathology, represents an innovative, promising avenue, enhancing robust composite biomarker derivation and promoting the interpretation of the biomedical problem studied.
机译:在这项工作中,关于源自不同患者的多峰数据的整合,研究了简单归因的效果。使用皮肤黑素瘤的两个独立数据集,即图像分析(皮肤镜)数据集和转录组数据集,特别是DNA微阵列。每种方式都与一组不同的患者相关,并且采用了四种插补方法来形成统一的集成数据集。向后选择与集成分类器(随机森林)的结合使用,然后进行主成分分析和线性判别分析,说明了归因于特征选择和降维方法的含义。结果表明,一般情况下,通过利用归因方案来实现数据集成来扩展特征空间,有助于分类任务,并在推定分类器的推导上赋予稳定性。特别是,尽管有偏推算方法大大提高了数据集的预测性能和分类判别能力,但它们仍有助于研究突出特征及其关系。单独数据集的融合提供了相同病理的多模式描述,代表了一种创新的,有希望的途径,增强了可靠的复合生物标记的推导并促进了对所研究生物医学问题的解释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号