首页> 外文期刊>Journal of biosciences >pubmed.mineR: An R package with text-mining algorithms to analyse PubMed abstracts
【24h】

pubmed.mineR: An R package with text-mining algorithms to analyse PubMed abstracts

机译:pubmed.mineR:具有文本挖掘算法的R包,用于分析PubMed摘要

获取原文
           

摘要

The PubMed literature database is a valuable source of information for scientific research. It is rich in biomedical literature with more than 24 million citations. Data-mining of voluminous literature is a challenging task. Although several text-mining algorithms have been developed in recent years with focus on data visualization, they have limitations such as speed, are rigid and are not available in the open source. We have developed an R package, pubmed.mineR, wherein we have combined the advantages of existing algorithms, overcome their limitations, and offer user flexibility and link with other packages in Bioconductor and the Comprehensive R Network (CRAN) in order to expand the user capabilities for executing multifaceted approaches. Three case studies are presented, namely, `Evolving role of diabetes educators', `Cancer risk assessment' and `Dynamic concepts on disease and comorbidity' to illustrate the use of pubmed.mineR. The package generally runs fast with small elapsed times in regular workstations even on large corpus sizes and with compute intensive functions. The pubmed.mineR is available at http://cran.r- project.org/web/packages/pubmed.mineR.
机译:PubMed文献数据库是科学研究的宝贵信息来源。它拥有丰富的生物医学文献,被引用超过2400万。大量文献的数据挖掘是一项艰巨的任务。尽管近年来开发了几种专注于数据可视化的文本挖掘算法,但它们具有速度,刚性等缺点,并且在开源中不可用。我们开发了一个R包pubmed.mineR,其中我们结合了现有算法的优势,克服了它们的局限性,并为用户提供了灵活性,并与Bioconductor和综合R网络(CRAN)中的其他包链接,以扩展用户执行多方面方法的能力。介绍了三个案例研究,分别是“糖尿病教育者的角色演变”,“癌症风险评估”和“疾病和合并症的动态概念”,以说明pubmed.mineR的使用。该软件包通常在常规工作站上运行时间短,运行速度快,即使在大型语料库中也具有计算密集型功能。 pubmed.mineR可从http://cran.r- project.org/web/packages/pubmed.mineR获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号