首页> 外文会议>KDD-09 workshop on statistical and relational learning in bioinformatics 2009 >Multi-Class Protein Fold Recognition using Large Margin Logic based Divide and Conquer Learning
【24h】

Multi-Class Protein Fold Recognition using Large Margin Logic based Divide and Conquer Learning

机译:使用基于大幅度逻辑的分而治之学习的多类蛋白质折叠识别

获取原文
获取原文并翻译 | 示例

摘要

Inductive Logic Programming (ILP) systems have been successfully applied to solve complex biological problem by vicwing them as binary classification tasks. It remains an open question how an accurate solution to a multi-class problem can be obtained by using a logic based learning method. In this paper we present a novel logic based approach to solve complex and challenging multi-class classification problems in bioinformatics by focusing on a particular task, namely protein fold recognition. Our technique is based on the use of large margin kernel-based methods in conjunction with first order rules induced by an ILP system. The proposed approach learns a multi-class classifier by using a divide and conquer reduction strategy that splits multi-classes into binary groups and solves each individual problem recursively hence generating an underlying decision list structure. The method is applied to assigning protein domains to folds. Experimental evaluation of the method demonstrates the efficacy of the proposed approach to solving complex multi-class classification problems in bioinformatics.
机译:通过归纳为二进制分类任务,归纳逻辑编程(ILP)系统已成功应用于解决复杂的生物学问题。仍然存在一个悬而未决的问题,即如何通过使用基于逻辑的学习方法来获得多类问题的准确解决方案。在本文中,我们提出了一种基于逻辑的新颖方法,通过专注于特定任务(即蛋白质折叠识别)来解决生物信息学中复杂且具有挑战性的多类分类问题。我们的技术基于结合基于大容限核的方法和ILP系统产生的一阶规则的使用。所提出的方法通过使用分而治之的减少策略来学习多类分类器,该策略将多类分为二进制组并递归解决每个单独的问题,从而生成基础决策列表结构。该方法适用于将蛋白质结构域分配给折叠。该方法的实验评估证明了该方法解决生物信息学中复杂的多类分类问题的功效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号