首页> 外文期刊>Current Bioinformatics >Statistical Analysis of TATA Box and Its Extensions in the Promoters of Human Genes
【24h】

Statistical Analysis of TATA Box and Its Extensions in the Promoters of Human Genes

机译:TATA盒及其在人类基因启动子中的扩展的统计分析

获取原文
获取原文并翻译 | 示例
           

摘要

We have conducted a dedicated analysis on the frequency distribution of the TATA Box and TATA extension sequences on six data sets of human promoters. Promoters in these sets have different lengths and are from different types of genes (housekeeping genes, tissue specific genes, and all genes). The statistical approach developed in this study will firstly partition the promoters into bins of 20 bp long, then calculate the frequency distribution of TATA elements and TATA extension sequences. The median value is used to capture outstanding TATA elements or TATA extension sequences when calculating their statistical significance. This study discovered that two of the 16 TATA Box elements (TATAAAAG and TATATAAG) showed the sharpest peaks at the location of 10∼30 bp upstream from transcription start sites where TATA Box is believed to reside. Fourteen TATA Box extensions showed the sharpest peaks at this location as well among all TATA extension sequences. Two of these fourteen TATA extension sequences have been verified to be the transcription factor binding sites by other research efforts. We suggest that the remaining twelve TATA extension sequences are the new putative TATA binding sites. This study also found that there was very little difference between the frequency distribution of TATA elements on housekeeping genes and their frequency distribution on tissue specific genes.
机译:我们对人类启动子的六个数据集上的TATA Box和TATA延伸序列的频率分布进行了专门的分析。这些集合中的启动子具有不同的长度,并且来自不同类型的基因(管家基因,组织特异性基因和所有基因)。本研究开发的统计方法将首先将启动子划分为20 bp长的条带,然后计算TATA元件和TATA延伸序列的频率分布。在计算它们的统计显着性时,中值用于捕获出色的TATA元素或TATA扩展序列。这项研究发现,在TATA Box的16个转录元件中,有两个(TATAAAAG和TATATAAG)在转录起始位点上游10〜30 bp处显示了最尖锐的峰。在所有TATA扩展序列中,十四个TATA Box扩展在此位置也显示了最尖锐的峰。这十四个TATA延伸序列中的两个已被其他研究工作证实是转录因子结合位点。我们建议其余十二个TATA延伸序列是新的假定TATA结合位点。这项研究还发现,持家基因上的TATA元件的频率分布与组织特定基因上的TATA元件的频率分布之间几乎没有差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号