首页> 外文期刊>Nucleic Acids Research >Detection of functional DNA motifs via statistical over-representation
【24h】

Detection of functional DNA motifs via statistical over-representation

机译:通过统计过表达检测功能性DNA基序

获取原文
获取原文并翻译 | 示例
           

摘要

The interaction of proteins with DNA recognition motifs regulates a number of fundamental biological processes, including transcription. To understand these processes, we need to know which motifs are present in a sequence and which factors bind to them. We describe a method to screen a set of DNA sequences against a precompiled library of motifs, and assess which, if any, of the motifs are statistically over- or under-represented in the sequences. Over-represented motifs are good candidates for playing a functional role in the sequences, while under-representation hints that if the motif were present, it would have a harmful dysregulatory effect. We apply our method (implemented as a computer program called Clover) to dopamine-responsive promoters, sequences flanking binding sites for the transcription factor LSF, sequences that direct transcription in muscle and liver, and Drosophila segmentation enhancers. In each case Clover successfully detects motifs known to function in the sequences, and intriguing and testable hypotheses are made concerning additional motifs. Clover compares favorably with an ab initio motif discovery algorithm based on sequence alignment, when the motif library includes only a homolog of the factor that actually regulates the sequences. It also demonstrates superior performance over two contingency table based over-representation methods. In conclusion, Clover has the potential to greatly accelerate characterization of signals that regulate transcription.
机译:蛋白质与DNA识别基序的相互作用调节了许多基本的生物学过程,包括转录。要了解这些过程,我们需要知道序列中存在哪些基序,哪些因子与其结合。我们描述了一种针对预编译的基序文库筛选一组DNA序列的方法,并评估其中的基序在统计学上是否过度或不足。过度代表的基序是在序列中发挥功能性作用的良好候选者,而代表不足的暗示提示,如果存在该基序,则将具有有害的失调作用。我们将我们的方法(作为称为Clover的计算机程序实现)应用于多巴胺反应性启动子,转录因子LSF结合位点侧翼的序列,在肌肉和肝脏中直接转录的序列以及果蝇分割增强剂。在每种情况下,Clover都能成功检测到已知在序列中起作用的基序,并针对其他基序提出了有趣且可检验的假设。当基序库仅包含实际上调节序列的因子的同源物时,Clover与基于序列比对的从头算起基元发现算法相比具有优势。它也展示了优于两种基于列联表的过度表示方法的优越性能。总之,三叶草具有极大地加速调节转录信号表征的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号