首页> 外文期刊>Bioinformatics >BioOptimizer: a Bayesian scoring function approach to motif discovery
【24h】

BioOptimizer: a Bayesian scoring function approach to motif discovery

机译:BioOptimizer:贝叶斯评分功能方法,用于发现主题

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Transcription factors (TFs) bind directly to short segments on the genome, often within hundreds to thousands of base pairs upstream of gene transcription start sites, to regulate gene expression. The experimental determination of TFs binding sites is expensive and time-consuming. Many motif-finding programs have been developed, but no program is clearly superior in all situations. Practitioners often find it difficult to judge which of the motifs predicted by these algorithms are more likely to be biologically relevant. Results: We derive a comprehensive scoring function based on a full Bayesian model that can handle unknown site abundance, unknown motif width and two-block motifs with variable-length gaps. An algorithm called BioOptimizer is proposed to optimize this scoring function so as to reduce noise in the motif signal found by any motif-finding program. The accuracy of BioOptimizer, which can be used in conjunction with several existing programs, is shown to be superior to using any of these motif-finding programs alone when evaluated by both simulation studies and application to sets of co-regulated genes in bacteria. In addition, this scoring function formulation enables us to compare objectively different predicted motifs and select the optimal ones, effectively combining the strengths of existing programs.
机译:动机:转录因子(TFs)直接与基因组上的短片段结合,通常在基因转录起始位点上游数百至数千个碱基对内,以调节基因表达。 TFs结合位点的实验确定是昂贵且费时的。已经开发了许多主题查找程序,但是没有一个程序在所有情况下都明显优越。从业人员常常发现很难判断这些算法所预测的哪些基序更可能与生物学相关。结果:基于完整的贝叶斯模型,我们得出了一个综合评分功能,该模型可以处理未知位点丰度,未知基序宽度和具有可变长度缺口的两嵌段基序。提出了一种称为BioOptimizer的算法来优化此评分功能,以减少任何主题查找程序发现的主题信号中的噪声。当通过模拟研究和将其应用于细菌中共同调控的基因组进行评估时,可以将BioOptimizer的准确性(可与多个现有程序结合使用)优于单独使用任何这些主题查找程序。此外,这种评分功能公式使我们能够客观地比较不同的预测主题并选择最佳主题,从而有效地结合了现有程序的优势。

著录项

  • 来源
    《Bioinformatics》 |2004年第10期|p. 1557-1564|共8页
  • 作者

    Shane T. Jensen; Jun S. Liu;

  • 作者单位

    Department of Statistics, Harvard University, Cambridge, MA 02138-2901, USA;

    Department of Statistics, Harvard University, Cambridge, MA 02138-2901, USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《化学文摘》(CA);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 生物科学;
  • 关键词

  • 入库时间 2022-08-17 23:50:21

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号