首页> 外文会议>International IEEE Conference on Intelligent Systems >Analysis of binary feature mapping rules for promoter recognition in imbalanced DNA sequence datasets using Support Vector Machine
【24h】

Analysis of binary feature mapping rules for promoter recognition in imbalanced DNA sequence datasets using Support Vector Machine

机译:使用支持向量机的不平衡DNA序列数据集启动子识别二元特征映射规则分析

获取原文

摘要

Recognition of specific functionally-important DNA sequence fragments is considered one of the most important problems in bioinformatics. One type of such fragments are promoters, i.e., short regulatory DNA sequences located upstream of a gene. Detection of promoters in DNA sequences is important for successful gene prediction. In this paper, a machine learning method, called Support Vector Machine (SVM), is used for classification of DNA sequences and promoter recognition. For optimal classification, 11 rules for mapping of DNA sequences into binary SVM feature space are analyzed. Classification is performed using a power series kernel function. Kernel parameters are optimized using a modification of the Nelder-Mead (downhill simplex) optimization method. The results of classification for drosophila and human sequence datasets are presented.
机译:识别特定的功能 - 重要的DNA序列片段被认为是生物信息学中最重要的问题之一。一种类型的这种片段是促进剂,即位于基因上游的短调节DNA序列。 DNA序列中启动子的检测对于成功的基因预测是重要的。在本文中,一种称为支持向量机(SVM)的机器学习方法用于分类DNA序列和启动子识别。为了获得最佳分类,分析了11个用于将DNA序列映射到二进制SVM特征空间的规则。使用电源系列内核功能进行分类。使用Nelder-Mead(Downhill Simplex)优化方法的修改进行了优化了内核参数。提出了果蝇和人序数据集的分类结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号