...
首页> 外文期刊>Nucleic Acids Research >A graph-based motif detection algorithm models complex nucleotide dependencies in transcription factor binding sites
【24h】

A graph-based motif detection algorithm models complex nucleotide dependencies in transcription factor binding sites

机译:基于图的基序检测算法模拟转录因子结合位点中的复杂核苷酸依赖性

获取原文
获取原文并翻译 | 示例
           

摘要

Given a set of known binding sites for a specific transcription factor, it is possible to build a model of the transcription factor binding site, usually called a motif model, and use this model to search for other sites that bind the same transcription factor. Typically, this search is performed using a position-specific scoring matrix (PSSM), also known as a position weight matrix. In this paper we analyze a set of eukaryotic transcription factor binding sites and show that there is extensive clustering of similar k-mers in eukaryotic motifs, owing to both functional and evolutionary constraints. The apparent limitations of probabilistic models in representing complex nucleotide dependencies lead us to a graph-based representation of motifs. Whendeciding whether a candidate k-mer is part of a motif or not, we base our decision not on how well the k-mer conforms to a model of the motif as a whole, but how similar it is to specific, known k-mers in the motif. We elucidate the reasons why we expect graph-based methods to perform well on motif data. Our MotifScan algorithm shows greatly improved performance over the prevalent PSSM-based method for the detection of eukaryotic motifs.
机译:给定一组特定转录因子的已知结合位点,可以建立通常称为基序模型的转录因子结合位点的模型,并使用该模型搜索结合相同转录因子的其他位点。通常,使用位置特定评分矩阵(PSSM)(也称为位置权重矩阵)执行此搜索。在本文中,我们分析了一组真核转录因子结合位点,并显示由于功能和进化方面的限制,在真核基序中存在大量相似的k-mer聚类。概率模型在表示复杂核苷酸依赖性方面的明显局限性导致我们找到基于图形的基序表示。在确定候选k-mer是否为基序的一部分时,我们的决定不是基于k-mer与整个基序模型的一致性如何,而是与特定的已知k-mer有多相似在主题。我们阐明了为什么我们期望基于图的方法在主题数据上表现良好的原因。我们的MotifScan算法显示出比基于PSSM的流行方法更高的检测真核基序的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号