【24h】

Multiple Kernel Learning for Fold Recognition

机译:多个内核学习折叠识别

获取原文

摘要

Fold recognition is a key problem in computational biology that involves classifying protein sharing structural similarities into classes commonly known as "folds". Recently, researchers have developed several efficient kernel based discriminatory methods for fold classification using sequence information. These methods train one-versus-rest binary classifiers using well optimized kernels from different data sources and techniques. Integrating this vast amount of data in the form of kernel matrices is an interesting and challenging problem. The semidefinite positive property of the various kernel matrices makes it attractive to cast the task of learning an optimal weighting of several kernel matrices as a semi-definite programming optimization problem. We experiment with a previously introduced quadratically constrained quadratic optimization problem for kernel integration using 1-norm and 2-norm support vector machines. We integrate state-of-the-art profile-based direct kernels to learn an optimal kernel matrix K*. Our experimental results show a small significant improvement in terms of the classification accuracy across the different fold classes. Our analysis illustrates the strength of two dominating kernels across different fold classes, which suggests the redundant nature of the kernel matrices being combined.
机译:折叠识别是计算生物学中的关键问题,涉及将蛋白质分类分享到通常称为“折叠”的课程中的结构相似之处。最近,研究人员已经开发了几种基于核心的鉴别方法,用于使用序列信息折叠分类。这些方法使用来自不同数据源和技术的良好优化的内核来列车一次与休闲二进制分类器。以内核矩阵的形式集成这一大量数据是一个有趣和具有挑战性的问题。各种内核矩阵的SemideFinite正面属性使得施放了学习几个内核矩阵的最佳加权作为半定编程优化问题的任务。我们使用1-NOM和2-NORM支持向量机进行内核集成进行先前引入的二次限制的二次优化问题。我们整合了最先进的基于型材的直接内核,以学习最佳的内核矩阵k *。我们的实验结果表明,在不同折叠类别的分类精度方面表现出小幅改进。我们的分析说明了跨不同折叠类的两个主导内核的强度,这表明组合的内核矩阵的冗余性质。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号