首页> 外文会议>International Conference on Intelligent Systems for Molecular Biology >Regulatory Element Detection using a Probabilistic Segmentation Model
【24h】

Regulatory Element Detection using a Probabilistic Segmentation Model

机译:使用概率分割模型检测调控元件检测

获取原文

摘要

The availability of genome-wide mRNA expression data for organisms whose genome is fully sequenced provides a unique data set from which to decipher how transcription is regulated by the upstream control region of a gene. A new algorithm is presented which decomposes DNA sequence into the most probable "dictionary" of motifs or words. Identification of words is based on a probabilistic segmentation model in which the significance of longer words is deduced from the frequency of shorter words of various length. This eliminates the need for a separate set of reference data to define probabilities, and genome-wide applications are therefore possible. For the 6000 upstream regulatory regions in the yeast genome, the 500 strongest motifs from a dictionary of size 1200 match at a significance level of 15 standard deviations to a database of cis-regulatory elements. Analysis of sets of genes such as those up-regulated during sporulation reveals many new putative regulatory sites in addition to identifying previously known sites.
机译:基因组的基因组的基因组mRNA表达数据的可用性提供了基因组的基因组提供了一种独特的数据集,从该数据集被解码是如何由基因的上游控制区域调节的。提出了一种新的算法,将DNA序列分解为主题或单词最可能的“字典”。单词的识别基于概率分割模型,其中从各种长度的较短单词的频率推导出更长的单词的重要性。这消除了对单独的参考数据进行单独的参考数据以定义概率,因此可以实现基因组应用。对于酵母基因组中的6000个上游调节区,来自大小1200字典的500个最强的图案,其具有与CIS-Charmatory元素数据库的15标准偏差的显着性水平。除了鉴定先前已知的位点之外,诸如孢子过程中上调的基因组的基因组的分析揭示了许多新推定的调节位点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号