首页> 外文会议> >Build a Dictionary, Learn a Grammar, Decipher Stegoscripts, and Discover Genomic Regulatory Elements
【24h】

Build a Dictionary, Learn a Grammar, Decipher Stegoscripts, and Discover Genomic Regulatory Elements

机译:建立字典,学习语法,解密隐写文字和发现基因组调控元件

获取原文
获取原文并翻译 | 示例

摘要

It has been a challenge to discover transcription factor (TF) binding motifs (TFBMs), which are short cis-regulatory DNA sequences playing essential roles in transcriptional regulation. We approach the problem of discovering TFBMs from a steganographic perspective. We view the regulatory regions of a genome as if they constituted a stegoscript with conserved words (I.e., TFBMs) being embedded in a covertext, and model the stegoscript with a statistical model consisting of a dictionary and a grammar. We develop an efficient algorithm, WordSpy, to learn such a model from a stegoscript and to recover conserved motifs. Subsequently, we select biologically meaningful motifs based on a motif's specificity to the set of genes of interest and/or the expression coherence of the genes whose promoters contain the motif. From the promoters of 645 distinct cell-cycle related genes of 5. Cerevisiae, our method is able to identify all known cell-cycle related TFBMs among its top ranking motifs. Our method can also be directly applied to discriminative motif finding. By utilizing the ChIP-chip data of Lee et al, we predicted potential binding motifs of 113 known transcription factors of budding yeast.
机译:发现转录因子(TF)结合基序(TFBM)是一个挑战,这是在转录调控中起重要作用的短顺式调控DNA序列。我们从隐写的角度处理发现TFBM的问题。我们将基因组的调控区看作是构成隐含保守词(即TFBM)的隐写文字的标本,并使用由字典和语法组成的统计模型对隐写文字进行建模。我们开发了一种有效的算法WordSpy,以从隐藏文字中学习这种模型并恢复保守的图案。随后,我们根据基序对目标基因组的特异性和/或其启动子包含基序的基因的表达一致性选择生物学上有意义的基序。从5.酿酒酵母的645个不同的细胞周期相关基因的启动子中,我们的方法能够在其最上位的基序中识别所有已知的与细胞周期相关的TFBM。我们的方法也可以直接用于判别主题。通过利用Lee等人的ChIP芯片数据,我们预测了113个已知的萌芽酵母转录因子的潜在结合基序。

著录项

  • 来源
    《》|2005年|80-94|共15页
  • 会议地点 San DiegoCA(US)
  • 作者

    Guandong Wang; Weixiong Zhang;

  • 作者单位

    Department of Computer Science and Engineering, Washington University in Saint Louis Saint Louis, MO 63130-4899, USA;

    Department of Genetics Washington University in Saint Louis Saint Louis, MO 63130-4899, USA;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 生物工程学(生物技术);
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号