【24h】

Multiple Kernel Learning for Fold Recognition

机译:用于折叠识别的多核学习

获取原文
获取原文并翻译 | 示例

摘要

Fold recognition is a key problem in computational biology that involves classifying protein sharing structural similarities into classes commonly known as "folds". Recently, researchers have developed several efficient kernel based discriminatory methods for fold classification using sequence information. These methods train one-versus-rest binary classifiers using well optimized kernels from different data sources and techniques.rnIntegrating this vast amount of data in the form of kernel matrices is an interesting and challenging problem. The semidefinite positive property of the various kernel matrices makes it attractive to cast the task of learning an optimal weighting of several kernel matrices as a semi-definite programming optimization problem. We experiment with a previously introduced quadratically constrained quadratic optimization problem for kernel integration using 1-norm and 2-norm support vector machines. We integrate state-of-the-art profile-based direct kernels to learn an optimal kernel matrix K*. Our experimental results show a small significant improvement in terms of the classification accuracy across the different fold classes. Our analysis illustrates the strength of two dominating kernels across different fold classes, which suggests the redundant nature of the kernel matrices being combined.
机译:折叠识别是计算生物学中的关键问题,涉及将蛋白质共享结构相似性分类为通常称为“折叠”的类。最近,研究人员开发了几种基于核的高效区分方法,用于使用序列信息进行折叠分类。这些方法使用来自不同数据源和技术的经过优化的内核来训练相对于静止的二进制分类器。以内核矩阵的形式集成大量数据是一个有趣且具有挑战性的问题。各种内核矩阵的半定正性使得将学习几个内核矩阵的最佳权重的任务作为半定规划优化问题变得有吸引力。我们尝试使用1-范数和2-范数支持向量机对内核引入的先前引入的二次约束二次优化问题进行实验。我们集成了基于概要文件的最新直接内核,以学习最佳内核矩阵K *。我们的实验结果表明,不同折页类别的分类准确度有很小的显着提高。我们的分析说明了跨不同折叠类的两个主导内核的强度,这表明了组合的内核矩阵的冗余性质。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号