首页> 美国卫生研究院文献>BMC Bioinformatics >gnparser: a powerful parser for scientific names based on Parsing Expression Grammar
【2h】

gnparser: a powerful parser for scientific names based on Parsing Expression Grammar

机译:gnparser:基于解析表达式语法的功能强大的科学名称解析器

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundScientific names in biology act as universal links. They allow us to cross-reference information about organisms globally. However variations in spelling of scientific names greatly diminish their ability to interconnect data. Such variations may include abbreviations, annotations, misspellings, etc. Authorship is a part of a scientific name and may also differ significantly. To match all possible variations of a name we need to divide them into their elements and classify each element according to its role. We refer to this as ‘parsing’ the name. Parsing categorizes name’s elements into those that are stable and those that are prone to change. Names are matched first by combining them according to their stable elements. Matches are then refined by examining their varying elements. This two stage process dramatically improves the number and quality of matches. It is especially useful for the automatic data exchange within the context of “Big Data” in biology.
机译:背景技术生物学中的科学名称是通用的链接。它们使我们能够在全球交叉引用有关生物的信息。但是,科学名称拼写的变化大大削弱了它们互连数据的能力。此类变化可能包括缩写,注释,拼写错误等。作者身份是科学名称的一部分,并且也可能有显着差异。为了匹配名称的所有可能变体,我们需要将其分为元素,并根据其作用对每个元素进行分类。我们称其为“解析”名称。解析将名称的元素归类为稳定的元素和易于更改的元素。首先通过根据名称的稳定元素将名称进行组合来匹配名称。然后通过检查匹配项的不同元素来完善匹配项。这两个阶段的过程大大提高了比赛的数量和质量。对于生物学中“大数据”环境中的自动数据交换而言,它特别有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号