首页> 外文期刊>Molecular BioSystems >Relative stability of DNA as a generic criterion for promoter prediction: whole genome annotation of microbial genomes with varying nucleotide base composition
【24h】

Relative stability of DNA as a generic criterion for promoter prediction: whole genome annotation of microbial genomes with varying nucleotide base composition

机译:DNA的相对稳定性作为启动子预测的通用标准:具有不同核苷酸碱基组成的微生物基因组的全基因组注释

获取原文
获取原文并翻译 | 示例
       

摘要

The rapid increase in genome sequence information has necessitated the annotation of their functional elements, particularly those occurring in the non-coding regions, in the genomic context. Promoter region is the key regulatory region, which enables the gene to be transcribed or repressed, but it is difficult to determine experimentally. Hence an in silico identification of promoters is crucial in order to guide experimental work and to pin point the key region that controls the transcription initiation of a gene. In this analysis, we demonstrate that while the promoter regions are in general less stable than the flanking regions, their average free energy varies depending on the GC composition of the flanking genomic sequence. We have therefore obtained a set of free energy threshold values, for genomic DNA with varying GC content and used them as generic criteria for predicting promoter regions in several microbial genomes, using an in-house developed tool 'PromPredict'. On applying it to predict promoter regions corresponding to the 1144 and 612 experimentally validated TSSs in E. coli (50.8% GC) and B. subtilis (43.5% GC) sensitivity of 99% and 95% and precision values of 58% and 60%, respectively, were achieved. For the limited data set of 81 TSSs available for M. tuberculosis (65.6% GC) a sensitivity of 100% and precision of 49% was obtained.
机译:基因组序列信息的迅速增加已经需要注释其功能元件,特别是在基因组背景下出现在非编码区的功能元件。启动子区域是关键的调控区域,它可以使基因被转录或抑制,但是很难通过实验确定。因此,启动子的计算机识别对于引导实验工作和确定控制基因转录起始的关键区域至关重要。在此分析中,我们证明了虽然启动子区域通常不如侧翼区域稳定,但它们的平均自由能随侧翼基因组序列的GC组成而变化。因此,我们使用内部开发的工具“ PromPredict”为具有变化的GC含量的基因组DNA获得了一组自由能阈值,并将其用作预测几个微生物基因组中启动子区域的通用标准。在将其用于预测对应于大肠杆菌(50.8%GC)和枯草芽孢杆菌(43.5%GC)中经实验验证的TSS的1144和612的启动子区域时,灵敏度分别为99%和95%和精确度值为58%和60%分别实现。对于可用于结核分枝杆菌(65.6%GC)的81个TSS的有限数据集,灵敏度为100%,精度为49%。

著录项

  • 来源
    《Molecular BioSystems》 |2009年第12期|1758-1769|共12页
  • 作者单位

    Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560 012, India;

    Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560 012, India;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号