首页> 外文会议>Optimization and Systems Biology >Distinguishing enzymes from non-enzymes via support vector machine
【24h】

Distinguishing enzymes from non-enzymes via support vector machine

机译:通过支持向量机将酶与非酶进行区分

获取原文
获取原文并翻译 | 示例

摘要

With many proteins sequenced, the ability of predicting protein function from sequence is becoming more and more important. Currently, methods for inference of the protein functional annotation are mostly based on identifying a known function protein which is similar to the query protein. However, for the proteins that are dissimilar or only similar to the unknown proteins, these methods will lose effectiveness. In this paper, we propose a new method for distinguishing enzymes from non-enzymes without similarity search. We use conjoint triad feature, secondary-structure content and surface pocket properties to describe 1178 high-resolution proteins, and apply support vector machine approach to assign these described proteins class. With 10-fold cross-validation, the accuracy of predicting functional class of enzymes and non-enzymes is about 85.19%. Moreover,by choosing the 'informative' features, the accuracy can be improved to 86.31%. These results suggest that this newly sequence-based method can be used to discover the other functional class membership of proteins.
机译:通过对许多蛋白质进行测序,从序列预测蛋白质功能的能力变得越来越重要。当前,用于推断蛋白质功能注释的方法主要基于鉴定与查询蛋白质相似的已知功能蛋白质。但是,对于与未知蛋白质不相似或仅相似的蛋白质,这些方法将失去效力。在本文中,我们提出了一种无需相似搜索即可区分酶与非酶的新方法。我们使用联合三联体特征,二级结构含量和表面口袋性质来描述1178个高分辨率蛋白质,并应用支持向量机方法来分配这些描述的蛋白质类别。通过10倍交叉验证,预测酶和非酶功能类别的准确性约为85.19%。此外,通过选择“信息性”功能,可以将准确性提高到86.31%。这些结果表明,这种基于序列的新方法可用于发现蛋白质的其他功能类别成员。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号