首页> 外文会议>International Conference on Knowledge-Based Intelligent Information and Engineering Systems >Structural Analysis of Promoter Sequences Using Grammar Inference and Support Vector Machine
【24h】

Structural Analysis of Promoter Sequences Using Grammar Inference and Support Vector Machine

机译:语法推理和支持向量机启动子序列的结构分析

获取原文

摘要

Promoters are short regulatory DNA sequences located upstream of a gene. Structural analysis of promoter sequences is important for successful gene prediction. Promoters can be recognized by certain patterns that are conserved within a species, but there are many exceptions which makes the structural analysis of promoters a complex problem. Grammar rules can be used for describing the structure of promoter sequences; however, derivation of such rules is not trivial. In this paper, stochastic L-grammar rules are derived automatically from known drosophila and vertebrate promoter and non-promoter sequences using genetic programming. The fitness of grammar rules is evaluated using a machine learning technique, called Support Vector Machine (SVM). SVM is trained on the known promoter sequences to obtain a discriminating function which serves as a means of evaluating a candidate grammar (a set of rules) by determining the percentage of generated sequences that are classified correctly. The combination of SVM and grammar rule inference can mitigate the lack of structural insight in machine learning approaches such as SVM.
机译:启动子是位于基因上游的短期调节DNA序列。启动子序列的结构分析对于成功的基因预测是重要的。启动子可以通过在物种内保守的某些模式来识别,但是许多例外情况使启动子的结构分析成为复杂的问题。语法规则可用于描述启动子序列的结构;但是,这种规则的推导不是微不足道的。本文使用遗传编程,随机L-语法规则自动从已知的果蝇和脊椎动物启动子和非启动子序列衍生出来。使用机器学习技术进行评估语法规则的适应性,称为支持向量机(SVM)。 SVM培训在已知的启动子序列上,以获得通过确定正确分类的所生成的序列的百分比来评估候选语法(一组规则)的鉴别函数。 SVM和语法规则推理的组合可以减轻机器学习方法(如SVM)的结构洞察力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号