首页> 外文会议>IEEE International Conference on Systems Biology >A novel pipeline for motif discovery, pruning and validation in promoter sequences of human tissue specific genes
【24h】

A novel pipeline for motif discovery, pruning and validation in promoter sequences of human tissue specific genes

机译:一种新型管道,用于人组织特异性基因启动子序列的促进和验证

获取原文

摘要

Identification and analysis of tissue-specific (TS) genes and their regulatory activities play an important role in the understanding of mechanisms of organisms, disease diagnosis and drug design. In this paper, we designed a pipeline for the discovery of promoter motifs for tissue-specific genes. The pipeline consists of three phases: motif searching, motif merging and motif validation. The motif searching phase integrated three algorithms: MEME, AlignACE and Gibbs Sampling. In the second phase, we proposed a motif merging method, which is based on Bayesian probabilistic principles, to reduce redundancies of motifs from the first phase. Lastly, the motif validation phase verified the statistical significance of discovered motifs using a Bayesian Hypothesis Test approach. We performed the analysis on the sequences of promoter regions (−449bp–1000bp) of 4,552 human tissue-specific genes across 82 tissues and 924 housekeeping genes. The distributions of motifs in different promoter regions show that most motifs prefer to be in the proximal region (+500∼50bp, −50bp∼–500bp) of promoters.
机译:组织特异性(TS)基因的鉴定和分析及其监管活动在理解生物体,疾病诊断和药物设计机制方面发挥着重要作用。在本文中,我们设计了一种用于发现组织特异性基因的启动子图案的管道。管道由三个阶段组成:图案搜索,图案合并和主题验证。主题搜索阶段集成的三种算法:MEME,Secondace和Gibbs采样。在第二阶段,我们提出了一种基于贝叶斯概率原理的基序合并方法,以减少从第一阶段的主题的冗余。最后,主题验证阶段使用贝叶斯假设试验方法验证了发现的图案的统计学意义。我们对82个组织和924个内政基因进行了4,552个人组织特异性基因的启动子区(-449bp-1000bp)序列的分析。不同启动子区域中的基序的分布表明,大多数基序更喜欢在近端区域(+ 500〜50bp,-50bp〜-500bp)的启动子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号