首页> 美国卫生研究院文献>Bioinformatics >M are better than one: an ensemble-based motif finder and its application to regulatory element prediction
【2h】

M are better than one: an ensemble-based motif finder and its application to regulatory element prediction

机译:M胜过一个:基于整体的主题发现器及其在调控元素预测中的应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: Identifying regulatory elements in genomic sequences is a key component in understanding the control of gene expression. Computationally, this problem is often addressed by motif discovery, where the goal is to find a set of mutually similar subsequences within a collection of input sequences. Though motif discovery is widely studied and many approaches to it have been suggested, it remains a challenging and as yet unresolved problem.>Results: We introduce SAMF (Solution-Aggregating Motif Finder), a novel approach for motif discovery. SAMF is based on a Markov Random Field formulation, and its key idea is to uncover and aggregate multiple statistically significant solutions to the given motif finding problem. In contrast to many earlier methods, SAMF does not require prior estimates on the number of motif instances present in the data, is not limited by motif length, and allows motifs to overlap. Though SAMF is broadly applicable, these features make it particularly well suited for addressing the challenges of prokaryotic regulatory element detection. We test SAMF's ability to find transcription factor binding sites in an Escherichia coli dataset and show that it outperforms previous methods. Additionally, we uncover a number of previously unidentified binding sites in this data, and provide evidence that they correspond to actual regulatory elements.>Contact: , ,>Supplementary information: are available at Bioinformatics online.
机译:>动机:识别基因组序列中的调控元件是理解基因表达控制的关键组成部分。从计算上讲,此问题通常通过基序发现解决,目标是在一组输入序列内找到一组相互相似的子序列。尽管动机发现的研究已经广泛研究,并且已经提出了许多解决方法,但它仍然是一个具有挑战性且尚未解决的问题。>结果:我们引入了SAMF(溶液聚合主题查找器),这是一种新颖的动机发现方法发现。 SAMF基于马尔可夫随机场公式,其主要思想是为给定的图案发现问题发现并聚合多个具有统计意义的解决方案。与许多早期方法相比,SAMF不需要对数据中存在的基序实例数量进行事先估算,不受基序长度的限制,并且可以使基元重叠。尽管SAMF广泛适用,但这些功能使其特别适合解决原核调节元件检测的挑战。我们测试了SAMF在大肠杆菌数据集中发现转录因子结合位点的能力,并证明它优于以前的方法。此外,我们在此数据中发现了许多以前未识别的结合位点,并提供了它们与实际调控元素相对应的证据。>联系方式: 、、、 >补充信息:在线生物信息学。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号