Building multiclass classifiers for remote homology detection and fold recognition

Huzefa Rangwala; George Karypis

首页> 外文期刊>BMC Bioinformatics >Building multiclass classifiers for remote homology detection and fold recognition

【24h】

Building multiclass classifiers for remote homology detection and fold recognition

机译：构建用于远程同源性检测和折叠识别的多分类器

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background Protein remote homology detection and fold recognition are central problems in computational biology. Supervised learning algorithms based on support vector machines are currently one of the most effective methods for solving these problems. These methods are primarily used to solve binary classification problems and they have not been extensively used to solve the more general multiclass remote homology prediction and fold recognition problems. Results We present a comprehensive evaluation of a number of methods for building SVM-based multiclass classification schemes in the context of the SCOP protein classification. These methods include schemes that directly build an SVM-based multiclass model, schemes that employ a second-level learning approach to combine the predictions generated by a set of binary SVM-based classifiers, and schemes that build and combine binary classifiers for various levels of the SCOP hierarchy beyond those defining the target classes. Conclusion Analyzing the performance achieved by the different approaches on four different datasets we show that most of the proposed multiclass SVM-based classification approaches are quite effective in solving the remote homology prediction and fold recognition problems and that the schemes that use predictions from binary models constructed for ancestral categories within the SCOP hierarchy tend to not only lead to lower error rates but also reduce the number of errors in which a superfamily is assigned to an entirely different fold and a fold is predicted as being from a different SCOP class. Our results also show that the limited size of the training data makes it hard to learn complex second-level models, and that models of moderate complexity lead to consistently better results.

机译：背景技术蛋白质远程同源性检测和折叠识别是计算生物学中的中心问题。目前，基于支持向量机的监督学习算法是解决这些问题的最有效方法之一。这些方法主要用于解决二进制分类问题，尚未广泛用于解决更一般的多类远程同源性预测和折叠识别问题。结果我们在SCOP蛋白质分类的背景下，对构建基于SVM的多类别分类方案的许多方法进行了全面评估。这些方法包括直接构建基于SVM的多类模型的方案，采用第二级学习方法来组合由一组基于二进制SVM的分类器生成的预测的方案以及为各个级别的SVM构建和组合二进制分类器的方案。 SCOP层次结构超出了定义目标类的层次结构。结论分析不同方法在四个不同数据集上获得的性能后，我们发现，大多数提议的基于多类SVM的分类方法在解决远程同源性预测和折叠识别问题方面非常有效，并且使用了从二进制模型构建的预测的方案SCOP层次结构中的祖先类别不仅会导致较低的错误率，而且会减少将超家族分配给完全不同的折叠并预测折叠来自不同SCOP类的错误的数量。我们的结果还表明，训练数据的数量有限，很难学习复杂的第二级模型，而中等复杂性的模型则可以始终如一地获得更好的结果。

著录项

来源
《BMC Bioinformatics》 |2006年第1期|共页
作者
Huzefa Rangwala; George Karypis;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类生物科学;
关键词

相似文献

外文文献
中文文献
专利

1. Remote protein homology detection and fold recognition using two-layer support vector machine classifiers. [J] . Muda HM, Saad P, Othman RM Computers in Biology and Medicine . 2011,第8期

机译：使用两层支持向量机分类器进行远程蛋白质同源性检测和折叠识别。
2. Protein Remote Homology Detection and Fold Recognition Based on Sequence-Order Frequency Matrix [J] . Liu Bin, Chen Junjie, Guo Mingyue, IEEE/ACM transactions on computational biology and bioinformatics . 2019,第1期

机译：基于序列序频率矩阵的蛋白质远程同源性检测和折叠识别
3. Scalable remote homology detection and fold recognition in massive protein networks [J] . Petegrosso Raphael, Li Zhuliu, Srour Molly A., Proteins: Structure, Function, and Genetics . 2019,第6期

机译：巨大蛋白质网络中可扩展的远程同源性检测和折叠识别
4. Protein Fold Recognition and Remote Homology Detection Based on Profile-Level Building Blocks [C] . Lin Lei, Shen Yi, Liu Bin, 2010 International Conference on Biomedical Engineering and Computer Science . 2010

机译：基于配置文件级构建块的蛋白质折叠识别和远程同源性检测
5. Application of a Hidden Bayes Naive Multiclass Classifier in Network Intrusion Detection [D] . Koc, Levent. 2013

机译：隐藏式贝叶斯朴素多类分类器在网络入侵检测中的应用
6. Building multiclass classifiers for remote homology detection and fold recognition [O] . Huzefa Rangwala, George Karypis 2006

机译：构建用于远程同源性检测和折叠识别的多类分类器
7. Building multiclass classifiers for remote homology detection and fold recognition [O] . Huzefa Rangwala, George Karypis 2012

机译：构建用于远程同源检测和折叠识别的多类分类器
8. Building Multiclass Classifiers for Remote Homology Detection and Fold Recognition [R] . Rangwala, H. , Karypis, G. 2006

机译：构建用于远程同源检测和折叠识别的多类分类器

Building multiclass classifiers for remote homology detection and fold recognition

摘要

著录项

相似文献

相关主题

期刊订阅