首页> 美国卫生研究院文献>Nucleic Acids Research >Olfactory Receptor Database: a metadata-driven automated population from sources of gene and protein sequences
【2h】

Olfactory Receptor Database: a metadata-driven automated population from sources of gene and protein sequences

机译:嗅觉受体数据库:来自基因和蛋白质序列的元数据驱动的自动种群

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The Olfactory Receptor Database (ORDB; http://senselab.med.yale.edu/senselab/ordb) is a central repository of olfactory receptor (OR) and olfactory receptor-like gene and protein sequences. To deal with the very large OR gene family, we have constructed an algorithm that automatically downloads sequences from web sources such as GenBank and SWISS-PROT into the database. The algorithm uses hypertext markup language (HTML) parsing techniques that extract information relevant to ORDB. The information is then correlated with the metadata in the ORDB knowledge base to encode the unstructured text extracted into the structured format compliant with the database architecture, entity attribute value with classes and relationship (EAV/CR), which supports the SenseLab project as a whole. Three population methods: batch, automatic and semi-automatic population are discussed. The data is imported into the database using extensible markup language (XML).
机译:嗅觉受体数据库(ORDB; http://senselab.med.yale.edu/senselab/ordb)是嗅觉受体(OR)以及类似嗅觉受体的基因和蛋白质序列的中央存储库。为了处理非常大的OR基因家族,我们构建了一种算法,该算法可自动从Web来源(如GenBank和SWISS-PROT)将序列下载到数据库中。该算法使用超文本标记语言(HTML)解析技术来提取与ORDB相关的信息。然后将信息与ORDB知识库中的元数据相关联,以对提取的非结构化文本进行编码,使其成为符合数据库体系结构,具有类和关系的实体属性值(EAV / CR)的结构化格式,从而整体上支持SenseLab项目。讨论了三种填充方法:批量填充,自动填充和半自动填充。使用可扩展标记语言(XML)将数据导入数据库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号