首页> 美国卫生研究院文献>Bioinformatics >Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery
【2h】

Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery

机译:通过规范化的无监督多核学习来集成不同的数据类型并将其应用于癌症亚型的发现

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: Despite ongoing cancer research, available therapies are still limited in quantity and effectiveness, and making treatment decisions for individual patients remains a hard problem. Established subtypes, which help guide these decisions, are mainly based on individual data types. However, the analysis of multidimensional patient data involving the measurements of various molecular features could reveal intrinsic characteristics of the tumor. Large-scale projects accumulate this kind of data for various cancer types, but we still lack the computational methods to reliably integrate this information in a meaningful manner. Therefore, we apply and extend current multiple kernel learning for dimensionality reduction approaches. On the one hand, we add a regularization term to avoid overfitting during the optimization procedure, and on the other hand, we show that one can even use several kernels per data type and thereby alleviate the user from having to choose the best kernel functions and kernel parameters for each data type beforehand.>Results: We have identified biologically meaningful subgroups for five different cancer types. Survival analysis has revealed significant differences between the survival times of the identified subtypes, with P values comparable or even better than state-of-the-art methods. Moreover, our resulting subtypes reflect combined patterns from the different data sources, and we demonstrate that input kernel matrices with only little information have less impact on the integrated kernel matrix. Our subtypes show different responses to specific therapies, which could eventually assist in treatment decision making.>Availability and implementation: An executable is available upon request.>Contact: or
机译:>动机:尽管正在进行癌症研究,但可用的疗法在数量和有效性上仍然有限,并且为个别患者制定治疗决策仍然是一个难题。可以帮助指导这些决策的既定子类型主要基于单个数据类型。然而,涉及各种分子特征测量的多维患者数据分析可以揭示肿瘤的内在特征。大型项目会收集各种癌症类型的此类数据,但我们仍然缺乏以有意义的方式可靠地集成此信息的计算方法。因此,我们将当前的多核学习应用并扩展为降维方法。一方面,我们添加了一个正则化项,以避免在优化过程中过度拟合;另一方面,我们表明,每个数据类型甚至可以使用多个内核,从而减轻了用户选择最佳内核功能的负担,并且>结果:我们已经为五种不同的癌症类型确定了具有生物学意义的亚组。生存分析表明,已鉴定亚型的生存时间之间存在显着差异,P值可与最新方法相比甚至更好。此外,我们得到的子类型反映了来自不同数据源的组合模式,并且我们证明了只有很少信息的输入内核矩阵对集成内核矩阵的影响较小。我们的子类型显示出对特定疗法的不同反应,最终可能有助于做出治疗决策。>可用性和实现:可根据要求提供可执行文件。>联系方式:

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号