首页> 美国卫生研究院文献>other >Parser Combinators: a Practical Application for Generating Parsers for NMR Data
【2h】

Parser Combinators: a Practical Application for Generating Parsers for NMR Data

机译:解析器组合器:用于为NMR数据生成解析器的实际应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Nuclear Magnetic Resonance (NMR) spectroscopy is a technique for acquiring protein data at atomic resolution and determining the three-dimensional structure of large protein molecules. A typical structure determination process results in the deposition of a large data sets to the BMRB (Bio-Magnetic Resonance Data Bank). This data is stored and shared in a file format called NMR-Star. This format is syntactically and semantically complex making it challenging to parse. Nevertheless, parsing these files is crucial to applying the vast amounts of biological information stored in NMR-Star files, allowing researchers to harness the results of previous studies to direct and validate future work. One powerful approach for parsing files is to apply a Backus-Naur Form (BNF) grammar, which is a high-level model of a file format. Translation of the grammatical model to an executable parser may be automatically accomplished. This paper will show how we applied a model BNF grammar of the NMR-Star format to create a free, open-source parser, using a method that originated in the functional programming world known as “parser combinators”. This paper demonstrates the effectiveness of a principled approach to file specification and parsing. This paper also builds upon our previous work [], in that 1) it applies concepts from Functional Programming (which is relevant even though the implementation language, Java, is more mainstream than Functional Programming), and 2) all work and accomplishments from this project will be made available under standard open source licenses to provide the community with the opportunity to learn from our techniques and methods.
机译:核磁共振(NMR)光谱技术是一种用于以原子分辨率获取蛋白质数据并确定大蛋白质分子的三维结构的技术。典型的结构确定过程导致将大量数据集存储到BMRB(生物磁共振数据库)中。此数据以称为NMR-Star的文件格式存储和共享。这种格式在语法和语义上都很复杂,很难解析。但是,解析这些文件对于应用NMR-Star文件中存储的大量生物信息至关重要,这使研究人员可以利用以前的研究结果来指导和验证未来的工作。解析文件的一种有效方法是应用Backus-Naur格式(BNF)语法,它是文件格式的高级模型。语法模型到可执行解析器的转换可以自动完成。本文将展示我们如何使用NMR-Star格式的BNF语法模型来创建免费的开放源代码解析器,并使用一种起源于功能编程领域的方法(称为“解析器组合器”)。本文演示了一种用于文件规范和解析的有原则方法的有效性。本文还基于我们先前的工作[1],它应用了函数式编程的概念(即使实现语言Java比函数式编程更主流,这还是很重要的),以及2)由此而来的所有工作和成就该项目将在标准的开放源代码许可下提供,从而为社区提供了学习我们的技术和方法的机会。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号