首页> 外文期刊>Comparative and functional genomics >A Web Service for Biomedical Term Look-Up
【24h】

A Web Service for Biomedical Term Look-Up

机译:用于生物医学术语查找的Web服务

获取原文
           

摘要

Recent years have seen a huge increase in the amount of biomedical informationthat is available in electronic format. Consequently, for biomedical researcherswishing to relate their experimental results to relevant data lurking somewhere withinthis expanding universe of on-line information, the ability to access and navigatebiomedical information sources in an efficient manner has become increasinglyimportant. Natural language and text processing techniques can facilitate this taskby making the information contained in textual resources such as MEDLINEmore readily accessible and amenable to computational processing. Names ofbiological entities such as genes and proteins provide critical links between differentbiomedical information sources and researchers' experimental data. Therefore,automatic identification and classification of these terms in text is an essentialcapability of any natural language processing system aimed at managing the wealthof biomedical information that is available electronically. To support term recognitionin the biomedical domain, we have developed Termino, a large-scale terminologicalresource for text processing applications, which has two main components: first, adatabase into which very large numbers of terms can be loaded from resources suchas UMLS, and stored together with various kinds of relevant information; second,a finite state recognizer, for fast and efficient identification and mark-up of termswithin text. Since many biomedical applications require this functionality, we havemade Termino available to the community as a web service, which allows for itsintegration into larger applications as a remotely located component, accessed througha standardized interface over the web.
机译:近年来,以电子格式提供的生物医学信息的数量大大增加。因此,对于希望将实验结果与潜伏在不断扩展的在线信息世界中某个地方的相关数据相关联的生物医学研究人员而言,以有效方式访问和导航生物医学信息源的能力变得越来越重要。自然语言和文本处理技术可通过使文本资源(例如MEDLINE)中包含的信息更易于访问且易于进行计算处理来简化此任务。基因和蛋白质等生物实体的名称提供了不同生物医学信息源与研究人员实验数据之间的关键联系。因此,这些术语在文本中的自动识别和分类是任何自然语言处理系统的基本功能,这些系统旨在管理可通过电子方式获得的大量生物医学信息。为了支持生物医学领域的术语识别,我们开发了Termino,这是一种用于文本处理应用程序的大规模术语资源,它具有两个主要组件:首先,一个数据库,可以从诸如UMLS之类的资源中加载大量术语并将其存储在一起具有各种相关信息;第二,有限状态识别器,用于快速有效地识别和标记文本中的术语。由于许多生物医学应用程序都需要此功能,因此我们已使Termino作为Web服务可供社区使用,它允许将其作为远程组件集成到较大的应用程序中,并通过Web上的标准化接口进行访问。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号