...
首页> 外文期刊>BMC Genomics >Revealing common disease mechanisms shared by tumors of different tissues of origin through semantic representation of genomic alterations and topic modeling
【24h】

Revealing common disease mechanisms shared by tumors of different tissues of origin through semantic representation of genomic alterations and topic modeling

机译:通过基因组改变的语义表示和主题建模揭示不同来源组织的肿瘤共有的常见疾病机制

获取原文
           

摘要

Background Cancer is a complex disease driven by somatic genomic alterations (SGAs) that perturb signaling pathways and consequently cellular function. Identifying patterns of pathway perturbations would provide insights into common disease mechanisms shared among tumors, which is important for guiding treatment and predicting outcome. However, identifying perturbed pathways is challenging, because different tumors can have the same perturbed pathways that are perturbed by different SGAs. Here, we designed novel semantic representations that capture the functional similarity of distinct SGAs perturbing a common pathway in different tumors. Combining this representation with topic modeling would allow us to identify patterns in altered signaling pathways. Results We represented each gene with a vector of words describing its function, and we represented the SGAs of a tumor as a text document by pooling the words representing individual SGAs. We applied the nested hierarchical Dirichlet process (nHDP) model to a collection of tumors of 5 cancer types from TCGA. We identified topics (consisting of co-occurring words) representing the common functional themes of different SGAs. Tumors were clustered based on their topic associations, such that each cluster consists of tumors sharing common functional themes. The resulting clusters contained mixtures of cancer types, which indicates that different cancer types can share disease mechanisms. Survival analysis based on the clusters revealed significant differences in survival among the tumors of the same cancer type that were assigned to different clusters. Conclusions The results indicate that applying topic modeling to semantic representations of tumors identifies patterns in the combinations of altered functional pathways in cancer.
机译:背景技术癌症是一种复杂的疾病,由体细胞基因组改变(SGA)驱动,会干扰信号传导途径并进而影响细胞功能。鉴定途径扰动的方式将提供对肿瘤之间共有的常见疾病机制的见解,这对于指导治疗和预测结果很重要。但是,确定扰动的路径是具有挑战性的,因为不同的肿瘤可能具有与不同的SGA扰动的相同的扰动路径。在这里,我们设计了新颖的语义表示形式,可以捕获在不同肿瘤中扰乱共同途径的不同SGA的功能相似性。将该表示形式与主题建模相结合,将使我们能够识别出变化的信号通路中的模式。结果我们用一个描述其功能的单词载体来代表每个基因,并通过合并代表单个SGA的单词来将肿瘤的SGAs作为文本文档来代表。我们将嵌套的分层Dirichlet过程(nHDP)模型应用于TCGA的5种癌症类型的肿瘤集合。我们确定了代表不同SGA共同功能主题的主题(由同时出现的单词组成)。根据肿瘤的主题关联对肿瘤进行聚类,从而使每个聚类由具有共同功能主题的肿瘤组成。产生的簇包含癌症类型的混合物,这表明不同的癌症类型可以共享疾病机制。基于聚类的生存分析显示,在分配给不同聚类的相同癌症类型的肿瘤之间,生存率存在显着差异。结论结果表明,将主题建模应用于肿瘤的语义表示可以识别癌症中功能途径改变的组合中的模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号