首页> 美国卫生研究院文献>Nucleic Acids Research >Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences
【2h】

Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences

机译:核苷酸序列中位置特异性得分矩阵代表的基序簇的统计意义

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The human genome encodes the transcriptional control of its genes in clusters of cis-elements that constitute enhancers, silencers and promoter signals. The sequence motifs of individual cis- elements are usually too short and degenerate for confident detection. In most cases, the requirements for organization of cis-elements within these clusters are poorly understood. Therefore, we have developed a general method to detect local concentrations of cis-element motifs, using predetermined matrix representations of the cis-elements, and calculate the statistical significance of these motif clusters. The statistical significance calculation is highly accurate not only for idealized, pseudorandom DNA, but also for real human DNA. We use our method ‘cluster of motifs E-value tool’ (COMET) to make novel predictions concerning the regulation of genes by transcription factors associated with muscle. COMET performs comparably with two alternative state-of-the-art techniques, which are more complex and lack E-value calculations. Our statistical method enables us to clarify the major bottleneck in the hard problem of detecting cis-regulatory regions, which is that many known enhancers do not contain very significant clusters of the motif types that we search for. Thus, discovery of additional signals that belong to these regulatory regions will be the key to future progress.
机译:人类基因组在组成增强子,沉默子和启动子信号的顺式元件簇中编码其基因的转录控制。单个顺式元件的序列基序通常太短且简并不能可靠检测。在大多数情况下,对这些簇中顺式元素的组织要求了解得很少。因此,我们已经开发了一种常规方法,使用预定的顺式元素矩阵表示法来检测顺式元素基序的局部浓度,并计算这些基序簇的统计显着性。统计显着性计算不仅对于理想化的伪随机DNA,而且对于真实人类DNA都是高度准确的。我们使用“图案群E值工具”(COMET)方法对与肌肉相关的转录因子对基因的调控做出了新的预测。 COMET与两种替代的最先进的技术具有可比性,后者更复杂且缺乏E值计算。我们的统计方法使我们能够弄清检测顺式调节区这一难题的主要瓶颈,这是许多已知的增强子都不包含我们要搜索的非常重要的基序类型簇。因此,发现属于这些调控区域的其他信号将是未来发展的关键。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号