...
首页> 外文期刊>BMC Bioinformatics >Combining classifiers for improved classification of proteins from sequence or structure
【24h】

Combining classifiers for improved classification of proteins from sequence or structure

机译:组合分类器以改善蛋白质从序列或结构的分类

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Background Predicting a protein's structural or functional class from its amino acid sequence or structure is a fundamental problem in computational biology. Recently, there has been considerable interest in using discriminative learning algorithms, in particular support vector machines (SVMs), for classification of proteins. However, because sufficiently many positive examples are required to train such classifiers, all SVM-based methods are hampered by limited coverage. Results In this study, we develop a hybrid machine learning approach for classifying proteins, and we apply the method to the problem of assigning proteins to structural categories based on their sequences or their 3D structures. The method combines a full-coverage but lower accuracy nearest neighbor method with higher accuracy but reduced coverage multiclass SVMs to produce a full coverage classifier with overall improved accuracy. The hybrid approach is based on the simple idea of "punting" from one method to another using a learned threshold. Conclusion In cross-validated experiments on the SCOP hierarchy, the hybrid methods consistently outperform the individual component methods at all levels of coverage. Code and data sets are available at http:/oble.gs.washington.edu/proj/sabretooth
机译:背景技术从其氨基酸序列或结构预测蛋白质的结构或功能类别是计算生物学中的基本问题。最近,人们对使用判别式学习算法(特别是支持向量机(SVM))进行蛋白质分类有着相当大的兴趣。但是,由于需要足够多的积极实例来训练此类分类器,因此所有基于SVM的方法都受到覆盖范围的限制。结果在本研究中,我们开发了一种混合机器学习方法对蛋白质进行分类,并将该方法应用于基于蛋白质序列或3D结构将蛋白质分配给结构类别的问题。该方法将全覆盖但精度较低的最近邻方法与精度较高但覆盖率降低的多类SVM相结合,以产生整体精度得到提高的全覆盖分类器。混合方法基于使用学习的阈值从一种方法“打孔”到另一种方法的简单思想。结论在SCOP层次结构的交叉验证实验中,混合方法在所有覆盖级别上始终优于单个组件方法。代码和数据集可从http:/oble.gs.washington.edu/proj/sabretooth获得

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号