首页> 外文期刊>International Journal of Neural Systems >THE PREDICTION OF BACTERIAL TRANSCRIPTION START SITES USING SVMS
【24h】

THE PREDICTION OF BACTERIAL TRANSCRIPTION START SITES USING SVMS

机译:使用SVMS预测细菌转录起始位点

获取原文
获取原文并翻译 | 示例
           

摘要

Identifying promoters is the key to understanding gene expression in bacteria. Promoters lie in tightly constrained positions relative to the transcription start site (TSS). In this paper, we address the problem of predicting transcription start sites in Escherichia coli. Knowing the TSS position, one can then predict the promoter position to within a few base pairs, and vice versa. The accepted method for promoter prediction is to use a pair of position weight matrices (PWMs), which define conserved motifs at the sigma-factor binding site. However this method is known to result in a large number of false positive predictions, thereby limiting its usefulness to the experimental biologist. We adopt an alternative approach based on the Support Vector Machine (SVM) using a modified mismatch spectrum kernel. Our modifications involve tagging the motifs with their location, and selectively pruning the feature set. We quantify the performance of several SVM models and a PWM model using a performance metric of area under the detection-error tradeoff (DET) curve. SVM models are shown to outperform the PWM on a biologically realistic TSS prediction task. We also describe a more broadly applicable peak scoring technique which reduces the number of false positive predictions, greatly enhancing the utility of our results.
机译:识别启动子是了解细菌中基因表达的关键。启动子位于相对于转录起始位点(TSS)严格限制的位置。在本文中,我们解决了预测大肠杆菌中转录起始位点的问题。知道了TSS的位置后,就可以将启动子的位置预测在几个碱基对之内,反之亦然。启动子预测的公认方法是使用一对位置权重矩阵(PWM),它们在sigma因子结合位点定义保守的基序。然而,已知该方法导致大量的假阳性预测,从而限制了其对实验生物学家的实用性。我们采用一种基于支持向量机(SVM)的替代方法,该方法使用了经过修改的失配频谱内核。我们的修改包括标记主题的位置,并有选择地修剪功能集。我们使用检测误差折衷(DET)曲线下的面积性能指标来量化几种SVM模型和PWM模型的性能。在生物学上现实的TSS预测任务中,SVM模型的性能优于PWM。我们还描述了一种更广泛适用的峰值评分技术,该技术减少了假阳性预测的数量,从而大大提高了结果的实用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号