...
首页> 外文期刊>PLoS Computational Biology >Biomarker Discovery by Sparse Canonical Correlation Analysis of Complex Clinical Phenotypes of Tuberculosis and Malaria
【24h】

Biomarker Discovery by Sparse Canonical Correlation Analysis of Complex Clinical Phenotypes of Tuberculosis and Malaria

机译:通过复杂的结核和疟疾临床表型的稀疏典范相关分析发现生物标志物

获取原文
           

摘要

Biomarker discovery aims to find small subsets of relevant variables in ‘omics data that correlate with the clinical syndromes of interest. Despite the fact that clinical phenotypes are usually characterized by a complex set of clinical parameters, current computational approaches assume univariate targets, e.g. diagnostic classes, against which associations are sought for. We propose an approach based on asymmetrical sparse canonical correlation analysis (SCCA) that finds multivariate correlations between the ‘omics measurements and the complex clinical phenotypes. We correlated plasma proteomics data to multivariate overlapping complex clinical phenotypes from tuberculosis and malaria datasets. We discovered relevant ‘omic biomarkers that have a high correlation to profiles of clinical measurements and are remarkably sparse, containing 1.5–3% of all ‘omic variables. We show that using clinical view projections we obtain remarkable improvements in diagnostic class prediction, up to 11% in tuberculosis and up to 5% in malaria. Our approach finds proteomic-biomarkers that correlate with complex combinations of clinical-biomarkers. Using the clinical-biomarkers improves the accuracy of diagnostic class prediction while not requiring the measurement plasma proteomic profiles of each subject. Our approach makes it feasible to use omics' data to build accurate diagnostic algorithms that can be deployed to community health centres lacking the expensive ‘omics measurement capabilities.
机译:生物标志物的发现旨在在组学数据中找到与相关临床症状相关的一小部分相关变量。尽管事实上临床表型通常以一组复杂的临床参数为特征,但当前的计算方法仍采用单变量目标,例如诊断类,寻求关联。我们提出了一种基于非对称稀疏典范相关分析(SCCA)的方法,该方法可在'组学测量值和复杂的临床表型之间找到多元相关性。我们将血浆蛋白质组学数据与结核病和疟疾数据集的多元重叠复杂临床表型相关联。我们发现了相关的“组学生物标记物”,其与临床测量结果具有高度相关性,并且稀疏,包含所有“组学变量”的1.5%至3%。我们显示,使用临床观点预测,我们在诊断类别预测方面获得了显着改善,其中结核病高达11%,疟疾高达5%。我们的方法发现与临床生物标记物的复杂组合相关的蛋白质组生物标记物。使用临床生物标记物可提高诊断类别预测的准确性,同时不需要测量每个受试者的血浆蛋白质组学特征。我们的方法使使用组学数据构建准确的诊断算法变得可行,该算法可部署到缺乏昂贵的组学测量功能的社区卫生中心。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号