首页> 外文会议> >Terminator Detection by Support Vector Machine Utilizing a Stochastic Context-Free Grammar
【24h】

Terminator Detection by Support Vector Machine Utilizing a Stochastic Context-Free Grammar

机译:支持向量机使用随机上下文无关文法的终止子检测

获取原文

摘要

A 2-stage detector was designed to find rho-independent transcription terminators in the Escherichia coli genome. The detector includes a stochastic context free grammar (SCFG) component and a support vector machine (SVM) component. To find terminators, the SCFG searches the intergenic regions of nucleotide sequence for local matches to a terminator grammar that was designed and trained utilizing examples of known terminators. The grammar selects sequences that are the best candidates for terminators and assigns them a prefix, stem-loop, suffix structure using the Cocke-Younger-Kasaami (CYK) algorithm, modified to incorporate energy effects of base pairing. The parameters from this inferred structure are passed to the SVM classifier, which distinguishes terminators from non-terminators that score high according to the terminator grammar. The SVM was trained with negative examples drawn from intergenic sequences that include both featureless and RNA gene regions (which were assigned prefix, stem-loop, suffix structure by the SCFG), so that it successfully distinguishes terminators from either of these. The classifier was found to be 96.4% successful during testing
机译:旨在在大肠杆菌基因组中找到一个2级探测器,在大肠杆菌基因组中找到RHO无关的转录终止子。检测器包括随机上下文自由语法(SCFG)部件和支撑载体机(SVM)组件。为了找到终结器,SCFG将核苷酸序列的核苷酸序列的亚核区域用于利用已知终止子的示例设计和训练的终止者语法。语法选择是终端主者最佳候选者的序列,并使用Cocke-ysther-Kasaami(Cyk)算法将它们分配一个前缀,茎环,后缀结构,修改以结合碱基配对的能量效应。来自该推断结构的参数传递给SVM分类器,其区分来自根据终结语法得分的非终端子的终端者。 SVM培训,由来自非基因序列中的阴性实施例培训,所述非基因序列包括无特征和RNA基因区域(由SCFG分配前缀,茎环,后缀结构),使得它成功区分了这些终端。在测试期间发现分类器成功96.4%

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号