...
首页> 外文期刊>Genome Biology >Sequence signatures extracted from proximal promoters can be used to predict distal enhancers
【24h】

Sequence signatures extracted from proximal promoters can be used to predict distal enhancers

机译:从近端启动子提取的序列特征可用于预测远端增强子

获取原文

摘要

Background: Gene expression is controlled by proximal promoters and distal regulatory elements such as enhancers. While the activity of some promoters can be invariant across tissues, enhancers tend to be highly tissue-specific. Results: We compiled sets of tissue-specific promoters based on gene expression profiles of 79 human tissues and cell types. Putative transcription factor binding sites within each set of sequences were used to train a support vector machine classifier capable of distinguishing tissue-specific promoters from control sequences. We obtained reliable classifiers for 92% of the tissues, with an area under the receiver operating characteristic curve between 60% (for subthalamic nucleus promoters) and 98% (for heart promoters). We next used these classifiers to identify tissue-specific enhancers, scanning distal non-coding sequences in the loci of the 200 most highly and lowly expressed genes. Thirty percent of reliable classifiers produced consistent enhancer predictions, with significantly higher densities in the loci of the most highly expressed compared to lowly expressed genes. Liver enhancer predictions were assessed in vivo using the hydrodynamic tail vein injection assay. Fifty-eight percent of the predictions yielded significant enhancer activity in the mouse liver, whereas a control set of five sequences was completely negative. Conclusions: We conclude that promoters of tissue-specific genes often contain unambiguous tissue-specific signatures that can be learned and used for the de novo prediction of enhancers.
机译:背景:基因表达受近端启动子和远端调控元件(如增强子)控制。尽管某些启动子的活性在整个组织中可能是不变的,但增强子往往具有高度组织特异性。结果:我们基于79种人类组织和细胞类型的基因表达谱,编制了一组组织特异性启动子。每组序列中的推定转录因子结合位点用于训练能够将组织特异性启动子与控制序列区分开的支持载体机器分类器。我们为92%的组织获得了可靠的分类器,在接收器工作特性曲线下的面积介于60%(对于丘脑下核启动子)和98%(对于心脏启动子)之间。接下来,我们使用这些分类器来识别组织特异性增强子,在200个表达最高和表达最低的基因的基因座中扫描远侧非编码序列。 30%的可靠分类器产生一致的增强子预测,与低表达的基因相比,高表达的基因座的密度明显更高。使用流体动力尾静脉注射测定法在体内评估肝增强剂的预测。 58%的预测在小鼠肝脏中产生了显着的增强子活性,而五个序列的对照组却完全阴性。结论:我们得出的结论是,组织特异性基因的启动子通常包含明确的组织特异性标记,可以学习并将其用于增强子的从头预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号