首页> 美国卫生研究院文献>other >An Empirical Study of Different Approaches for Protein Classification
【2h】

An Empirical Study of Different Approaches for Protein Classification

机译:不同蛋白质分类方法的实证研究

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Many domains would benefit from reliable and efficient systems for automatic protein classification. An area of particular interest in recent studies on automatic protein classification is the exploration of new methods for extracting features from a protein that work well for specific problems. These methods, however, are not generalizable and have proven useful in only a few domains. Our goal is to evaluate several feature extraction approaches for representing proteins by testing them across multiple datasets. Different types of protein representations are evaluated: those starting from the position specific scoring matrix of the proteins (PSSM), those derived from the amino-acid sequence, two matrix representations, and features taken from the 3D tertiary structure of the protein. We also test new variants of proteins descriptors. We develop our system experimentally by comparing and combining different descriptors taken from the protein representations. Each descriptor is used to train a separate support vector machine (SVM), and the results are combined by sum rule. Some stand-alone descriptors work well on some datasets but not on others. Through fusion, the different descriptors provide a performance that works well across all tested datasets, in some cases performing better than the state-of-the-art.
机译:许多领域将从可靠而有效的自动蛋白质分类系统中受益。在有关自动蛋白质分类的最新研究中,特别感兴趣的领域是探索从蛋白质中提取对某些特定问题有效的特征的新方法。但是,这些方法不能通用化,并且仅在少数几个领域证明是有用的。我们的目标是通过在多个数据集中测试蛋白质来评估代表蛋白质的几种特征提取方法。评价了不同类型的蛋白质表示形式:从蛋白质的位置特定评分矩阵(PSSM)开始的蛋白质表示形式,从氨基酸序列衍生的蛋白质表示形式,两个矩阵表示形式以及从蛋白质的3D三级结构获取的特征。我们还测试了蛋白质描述符的新变体。我们通过比较和组合从蛋白质表示中提取的不同描述符来实验性地开发我们的系统。每个描述符用于训练一个单独的支持向量机(SVM),并且结果通过求和规则进行组合。一些独立的描述符在某些数据集上效果很好,但在其他数据集上效果不好。通过融合,不同的描述符提供的性能可以在所有测试的数据集上很好地工作,在某些情况下,其性能要比最新技术更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号