首页> 外文期刊>Bioinformatics >M are better than one: an ensemble-based motif finder and its application to regulatory element prediction
【24h】

M are better than one: an ensemble-based motif finder and its application to regulatory element prediction

机译:M胜过一个:基于整体的基元查找器及其在调控元素预测中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: Identifying regulatory elements in genomic sequences is a key component in understanding the control of gene expression. Computationally, this problem is often addressed by motif discovery, where the goal is to find a set of mutually similar subsequences within a collection of input sequences. Though motif discovery is widely studied and many approaches to it have been suggested, it remains a challenging and as yet unresolved problem.Results: We introduce SAMF (Solution-Aggregating Motif Finder), a novel approach for motif discovery. SAMF is based on a Markov Random Field formulation, and its key idea is to uncover and aggregate multiple statistically significant solutions to the given motif finding problem. In contrast to many earlier methods, SAMF does not require prior estimates on the number of motif instances present in the data, is not limited by motif length, and allows motifs to overlap. Though SAMF is broadly applicable, these features make it particularly well suited for addressing the challenges of prokaryotic regulatory element detection. We test SAMFs ability to find transcription factor binding sites in an Escherichia coli dataset and show that it outperforms previous methods. Additionally, we uncover a number of previously unidentified binding sites in this data, and provide evidence that they correspond to actual regulatory elements.
机译:动机:鉴定基因组序列中的调控元件是理解基因表达控制的关键组成部分。从计算上来说,这个问题通常是通过基序发现来解决的,其目的是在输入序列的集合中找到一组相互相似的子序列。尽管对基序发现进行了广泛的研究并提出了许多解决方案,但它仍然是一个具有挑战性且尚未解决的问题。 SAMF基于马尔可夫随机场公式,其关键思想是发现和聚合给定主题发现问题的多个具有统计意义的解决方案。与许多早期方法相比,SAMF不需要对数据中存在的基序实例数量进行事先估算,不受基序长度的限制,并且可以使基元重叠。尽管SAMF广泛适用,但这些功能使其特别适合解决原核调节元件检测的挑战。我们测试了SAMFs在大肠杆菌数据集中查找转录因子结合位点的能力,并证明它优于以前的方法。此外,我们在此数据中发现了许多先前未识别的结合位点,并提供了证据表明它们与实际的调控元素相对应。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号