首页> 外文期刊>BMC Bioinformatics >An open source infrastructure for managing knowledge and finding potential collaborators in a domain-specific subset of PubMed, with an example from human genome epidemiology
【24h】

An open source infrastructure for managing knowledge and finding potential collaborators in a domain-specific subset of PubMed, with an example from human genome epidemiology

机译:一个开源基础结构,用于管理知识并在PubMed的特定领域子集中寻找潜在的合作者,并举例说明人类基因组流行病学

获取原文
           

摘要

Background Identifying relevant research in an ever-growing body of published literature is becoming increasingly difficult. Establishing domain-specific knowledge bases may be a more effective and efficient way to manage and query information within specific biomedical fields. Adopting controlled vocabulary is a critical step toward data integration and interoperability in any information system. We present an open source infrastructure that provides a powerful capacity for managing and mining data within a domain-specific knowledge base. As a practical application of our infrastructure, we presented two applications – Literature Finder and Investigator Browser – as well as a tool set for automating the data curating process for the human genome published literature database. The design of this infrastructure makes the system potentially extensible to other data sources. Results Information retrieval and usability tests demonstrated that the system had high rates of recall and precision, 90% and 93% respectively. The system was easy to learn, easy to use, reasonably speedy and effective. Conclusion The open source system infrastructure presented in this paper provides a novel approach to managing and querying information and knowledge from domain-specific PubMed data. Using the controlled vocabulary UMLS enhanced data integration and interoperability and the extensibility of the system. In addition, by using MVC-based design and Java as a platform-independent programming language, this system provides a potential infrastructure for any domain-specific knowledge base in the biomedical field.
机译:背景技术在不断增长的已出版文献中确定相关研究变得越来越困难。建立特定领域的知识库可能是管理和查询特定生物医学领域内信息的更有效途径。在任何信息系统中,采用受控词汇都是迈向数据集成和互操作性的关键一步。我们提供了一个开放源代码基础结构,它提供了强大的能力来管理和挖掘特定领域知识库中的数据。作为基础架构的实际应用,我们介绍了两个应用程序-文献查找器和研究者浏览器-以及用于自动化人类基因组出版文献数据库的数据整理过程的工具集。该基础结构的设计使系统有可能扩展到其他数据源。结果信息检索和可用性测试表明,该系统具有较高的查全率和查全率,分别为90%和93%。该系统易于学习,易于使用,相当快速且有效。结论本文介绍的开源系统基础结构提供了一种新颖的方法,用于管理和查询特定于域的PubMed数据中的信息和知识。使用受控词汇表UMLS可以增强数据集成,互操作性以及系统的可扩展性。另外,通过使用基于MVC的设计和Java作为与平台无关的编程语言,该系统为生物医学领域中任何特定领域的知识库提供了潜在的基础结构。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号