首页> 外文期刊>The international arab journal of information technology >Improving Classification Performance Using Genetic Programming to Evolve String Kernels
【24h】

Improving Classification Performance Using Genetic Programming to Evolve String Kernels

机译:使用基因编程提高分类性能以演变串核

获取原文
获取原文并翻译 | 示例
           

摘要

The objective of this work is to present a novel evolutionary-based approach that can create and optimize powerful string kernels using Genetic Programming. The proposed model creates and optimizes a superior kernel, which is expressed as a combination of string kernels, their parameters, and corresponding weights. As a proof of concept to demonstrate the feasibility of the presented approach, classification performance of the newly evolved kernel versus a group of conventional single string kernels was evaluated using a challenging classification problem from biology domain known as theclassification of binder and non-binder peptides to Major Histocompatibility Complex Class II. Using 4794 strings containing 3346 binder and 1448 non-binder peptides, the present approach achieved Area Under Curve=0.80, while the 11 tested conventional string kernels have Area Under Curve ranging from 0.59 to 0.75. This significant improvement of the optimized evolved kernel over all other tested string kernels demonstrates the validity of this approach for enhancing Support Vector Machine classification. The presented approach is not exclusive for biological strings. It can be applied to solve pattern recognition problems for other types of strings as well as natural language processing.
机译:这项工作的目的是介绍一种基于进化的基于进化的方法,可以使用遗传编程创建和优化强大的字符串内核。所提出的模型创建并优化了卓越的内核,其表示为字符串内核,其参数和相应权重的组合。作为概念证明,展示所提出的方法的可行性,使用来自称为粘合剂和非粘合剂肽的生物结构域的具有挑战性的分类问题,评估新进化内核的分类性能与传统单字符串内核进行评估主要的组织相容性复杂类II。使用含有3346个粘合剂和1448个非粘合剂肽的4794个弦,本方法在曲线下实现了面积= 0.80,而11所经过测试的常规串核的曲线具有0.59至0.75的曲线。在所有其他测试的字符串内核上,优化的进化内核的这种显着改进展示了这种方法,用于增强支持向量机分类的方法。所提出的方法不是用于生物弦的排他性。它可以应用于解决其他类型的字符串以及自然语言处理的模式识别问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号