首页> 外文会议>International Conference on BioMedical Engineering and Informatics >Quantitative measurement of clinic-genomic association for colorectal cancer using literature mining and Google-distance algorithm
【24h】

Quantitative measurement of clinic-genomic association for colorectal cancer using literature mining and Google-distance algorithm

机译:用文献挖掘和谷歌距离算法定量测量结肠直肠癌结肠癌癌症

获取原文

摘要

Nowadays, a growing number of researchers devote themselves to re-excavation of existing biomedical knowledge discovery, focusing on how to establish associations between clinical and genomic data. However, quantitative analysis is still inadequate for a particular disease. Colorectal cancer is the one of malignant tumors whose molecular mechanism is relatively clear, making it a more appropriate object of study. This paper proposed a quantitative measurement of clinic-genomic associations for colorectal cancer based on Google Distance, using MEDLINE database as the corpus. Our method is engineered with several technologies, including mapping clinic and genomic data to MeSH terms, modifying Normalized Google Distance using year average. Data from Electronic Medical Records (EMR), Online Mendelian Inheritance in Man (OMIM), and Genetic Association Database (GAD) were used in this study. A total of 3795 clinic-genomic associations of colorectal cancer between 67 clinical concepts and 236 genes were obtained, of which 584 associations were identified for their gene is contained in the colorectal cancer pathway using KEGG pathway analysis. Assessment and interpretation were conducted using KEGG, GeneCards, and then getting new discoveries. This method is valid in quantitative analysis using biomedical literature and achieves a good performance in measuring the clinical data and genomic data, which can be transplanted to other disease research.
机译:如今,越来越多的研究人员致力于重新开挖现有的生物医学知识发现,重点是如何建立临床和基因组数据之间的关联。然而,定量分析对于特定疾病仍然不足。结肠直肠癌是恶性肿瘤中的一种,其分子机制相对清晰,使其成为更合适的研究对象。本文提出了基于Google距离的基于Google距离的结肠癌癌症的临床基因组关联的定量测量,使用Medline Database作为语料库。我们的方法具有多种技术,包括将诊所和基因组数据映射到网格术语,使用年平均修改标准化的Google距离。在本研究中使用了来自电子医疗记录(EMR)的数据,在线孟德尔遗传,以及遗传关联数据库(GAD)。在67个临床概念和236个基因之间共结直肠癌的3795个临床基因组缔结直肠癌缔结直肠癌,其中鉴定了584个关联,其基因包含在结直肠癌途径中,使用KEGG途径分析。使用KEGG,GENECARDS进行评估和解释,然后获得新发现。该方法在使用生物医学文献的定量分析中是有效的,并且在测量临床数据和基因组数据方面取得了良好的性能,这可以移植到其他疾病研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号