...
首页> 外文期刊>Genome research >Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression.
【24h】

Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression.

机译:全基因组的转录调控模块的计算预测揭示了人类基因表达的新见解。

获取原文
获取原文并翻译 | 示例
           

摘要

The identification of regulatory regions is one of the most important and challenging problems toward the functional annotation of the human genome. In higher eukaryotes, transcription-factor (TF) binding sites are often organized in clusters called cis-regulatory modules (CRM). While the prediction of individual TF-binding sites is a notoriously difficult problem, CRM prediction has proven to be somewhat more reliable. Starting from a set of predicted binding sites for more than 200 TF families documented in Transfac, we describe an algorithm relying on the principle that CRMs generally contain several phylogenetically conserved binding sites for a few different TFs. The method allows the prediction of more than 118,000 CRMs within the human genome. A subset of these is shown to be bound in vivo by TFs using ChIP-chip. Their analysis reveals, among other things, that CRM density varies widely across the genome, with CRM-rich regions often being located near genes encoding transcription factors involved in development. Predicted CRMs show a surprising enrichment near the 3' end of genes and in regions far from genes. We document the tendency for certain TFs to bind modules located in specific regions with respect to their target genes and identify TFs likely to be involved in tissue-specific regulation. The set of predicted CRMs, which is made available as a public database called PReMod (http://genomequebec.mcgill.ca/PReMod), will help analyze regulatory mechanisms in specific biological systems.
机译:对于人类基因组的功能注释,调节区域的识别是最重要和最具挑战性的问题之一。在高级真核生物中,转录因子(TF)结合位点通常以称为顺式调节模块(CRM)的簇的形式组织。尽管单个TF结合位点的预测是一个众所周知的难题,但CRM预测已被证明更加可靠。从Transfac中记录的200多个TF家族的预测结合位点开始,我们描述了一种算法,该算法基于以下原则:CRM通常包含几个不同TF的几个系统发育上保守的结合位点。该方法可以预测人类基因组中超过118,000个CRM。使用ChIP芯片,这些子集显示与TF结合。他们的分析显示,除其他外,CRM密度在整个基因组中差异很大,其中CRM丰富的区域通常位于编码参与发育的转录因子的基因附近。预测的CRM会在基因的3'端附近和远离基因的区域显示出令人惊讶的富集。我们记录了某些TF结合其目标基因相对于位于特定区域的模块的趋势,并确定了可能参与组织特异性调控的TF。这套预测CRM可以作为称为PReMod(http://genomequebec.mcgill.ca/PReMod)的公共数据库提供,将有助于分析特定生物系统中的调节机制。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号