首页> 美国卫生研究院文献>other >Clinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets
【2h】

Clinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets

机译:使用公开可用数据集的大肠癌临床基因组关联挖掘

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In recent years, a growing number of researchers began to focus on how to establish associations between clinical and genomic data. However, up to now, there is lack of research mining clinic-genomic associations by comprehensively analysing available gene expression data for a single disease. Colorectal cancer is one of the malignant tumours. A number of genetic syndromes have been proven to be associated with colorectal cancer. This paper presents our research on mining clinic-genomic associations for colorectal cancer under biomedical big data environment. The proposed method is engineered with multiple technologies, including extracting clinical concepts using the unified medical language system (UMLS), extracting genes through the literature mining, and mining clinic-genomic associations through statistical analysis. We applied this method to datasets extracted from both gene expression omnibus (GEO) and genetic association database (GAD). A total of 23517 clinic-genomic associations between 139 clinical concepts and 7914 genes were obtained, of which 3474 associations between 31 clinical concepts and 1689 genes were identified as highly reliable ones. Evaluation and interpretation were performed using UMLS, KEGG, and Gephi, and potential new discoveries were explored. The proposed method is effective in mining valuable knowledge from available biomedical big data and achieves a good performance in bridging clinical data with genomic data for colorectal cancer.
机译:近年来,越来越多的研究人员开始致力于如何在临床和基因组数据之间建立关联。然而,到目前为止,还缺乏通过全面分析单个疾病的可用基因表达数据来挖掘临床基因组关联的研究。大肠癌是恶性肿瘤之一。已经证明许多遗传综合征与大肠癌有关。本文介绍了我们在生物医学大数据环境下挖掘结直肠癌的临床基因组关联的研究。该方法采用多种技术进行工程设计,包括使用统一的医学语言系统(UMLS)提取临床概念,通过文献挖掘提取基因以及通过统计分析挖掘临床基因组关联。我们将此方法应用于从基因表达综合(GEO)和遗传关联数据库(GAD)提取的数据集。共获得139个临床概念与7914个基因之间的23517个临床基因组关联,其中31个临床概念与1689个基因之间的3474个关联被确定为高度可靠的关联。使用UMLS,KEGG和Gephi进行了评估和解释,并探索了潜在的新发现。所提出的方法有效地从可用的生物医学大数据中挖掘有价值的知识,并且在结直肠癌的临床数据与基因组数据的桥接方面取得了良好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号