首页> 美国卫生研究院文献>Nucleic Acids Research >PACRAT: a database and analysis system for archaeal and bacterial intergenic sequence features
【2h】

PACRAT: a database and analysis system for archaeal and bacterial intergenic sequence features

机译:PACRAT:用于古细菌和细菌基因间序列特征的数据库和分析系统

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Analysis of intergenic sequences for purposes such as the investigation of transcriptional signals or the identification of small RNA genes is frequently complicated by traditional biological database structures. Genome data is commonly treated as chromosome-length sequence records, detailed by gene calls demarcating subsequences of the chromosomes. Given this model, the determination of non-called subsequences between any gene and its nearest neighbors requires an exhaustive search of all gene calls associated with the chromosome. Further compounding the issue, the location of intergenic regions for many called genes cannot be resolved unambiguously due to uncertainties in gene boundaries, as well as the presence of other conflicting gene calls. To address these difficulties we have constructed the PACRAT () database system. PACRAT preprocesses GenBank genome submissions, evaluates for every gene the character of its relationship to those genes nearest to it, and produces a relationally linked model of the gene ordering for the genome. Using this information, the interface allows the researcher to query gene data as well as intergenic sequence data based on a number of criteria. These include the ability to filter searches based on the status of start and stop positions, or upstream/downstream sequences as conflicting with called genes and automated extension of upstream or downstream searches to find probable operon promoters or terminators. The database is also indexed by KEGG classification, allowing, for example, functionally-related groups of high-quality promoter-containing regions to be easily retrieved as a group.
机译:传统生物学数据库结构通常会为了分析目的而进行基因间序列分析,例如研究转录信号或鉴定小RNA基因。基因组数据通常被视为染色体长度序列记录,由划分染色体子序列的基因调用来详细描述。在这种模型的情况下,要确定任何基因与其最接近的邻居之间的非所谓子序列,需要详尽搜索与染色体相关的所有基因调用。使问题更加复杂的是,由于基因边界的不确定性以及其他相互冲突的基因调用的存在,许多被调用基因的基因间区域的位置无法得到明确解析。为了解决这些困难,我们构建了PACRAT()数据库系统。 PACRAT预处理GenBank基因组提交的内容,为每个基因评估其与最接近的基因的关系的特征,并生成基因组的基因顺序的相关链接模型。使用此信息,该接口可以使研究人员根据许多标准查询基因数据以及基因间序列数据。这些功能包括根据起始和终止位置或上游/下游序列与被叫基因发生冲突的状态过滤搜索的功能,以及自动扩展上游或下游搜索以寻找可能的操纵子启动子或终止子的能力。该数据库还通过KEGG分类进行索引,从而可以轻松地将高质量启动子包含区域的功能相关组作为一个组进行检索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号