首页> 外文会议>International Conference on Communication and Electronics Systems >Bi-directional Methodology for Literature Extraction from PubMed Abstracts using Web Scrapper and Web Crawler
【24h】

Bi-directional Methodology for Literature Extraction from PubMed Abstracts using Web Scrapper and Web Crawler

机译:使用Web抓取器和Web搜寻器从PubMed摘要中提取文献的双向方法

获取原文

摘要

Searching of literaure is the prime processes in any research to gain insights on the progress carried out on specific problems. Literature search in biological science and bioinforamtics is more complex as the researchers has to find the relrevant articles specified to the biological components like DNA, RNA, Gene, PROTEIN. Pubmed is a literature database which cosists of millions of articles submitted by researcher based on the experimerntal studies. Analysing Gene for the study is important for studies and experiments related to diseases. In this paper a bi-directional methdology is propsoed to extract literarure from pubmed abstracts using web scrapper and web crawler by providing gene names and map gene with Gene card Database. In the second approach a methdology is provided to analyse Gene expression profiles from GEO database using EDA and Map the significant gene reterived to the Pubmed and Gene Card database. The differentially expressed genes are grouped after performing PCA and clustered using k-means. Lung cancer is major cancer reported in WHO and hence in this study the Lung cancer is used as the usecase to test the methdology proposed. The experiemnts are implemented using Python. From the experimental results it is found that the propsoed approach reduce the time for searching the literature.
机译:文献搜索是任何研究中的素质流程,以获得关于对特定问题进行的进展的见解。在生物科学和生物侵害中的文献搜索更复杂,因为研究人员必须找到为DNA,RNA,基因,蛋白质等生物组分指定的重新遗传物品。 PubMed是一个文学数据库,由研究人员基于实验研究的研究人员提交的数百万文章。分析研究基因对于与疾病有关的研究和实验是重要的。在本文中,通过提供基因名称和地图基因与基因卡数据库,将双向甲状甲基学被预先从PubMed摘要提取文学,以通过使用基因名称和映射基因来提取来自PubMed摘要。在第二种方法中,提供了使用EDA从Geo数据库分析基因表达谱的甲状物质,并将重新定位的重要基因映射到PubMed和基因卡数据库。差异表达的基因在进行PCA之后进行分组并使用K-Means聚类。肺癌是世卫组织的主要癌症,因此在这项研究中,肺癌用作usecase以测试提出的甲状甲。使用Python实现经验。从实验结果来看,发现预探测方法减少了搜索文献的时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号