首页> 外文会议>International Conference on Computational Science and Its Applications;ICCSA 2008 >Classification of Ligase Function Based on Multi-parametric Feature Extracted from Protein Sequence
【24h】

Classification of Ligase Function Based on Multi-parametric Feature Extracted from Protein Sequence

机译:基于蛋白质序列提取的多参数特征的连接酶功能分类

获取原文

摘要

One of the important goals of bioinformatics is to classify and predict the functions of proteins that have no sequence homolog of known functions. The purpose of this paper is to classify protein function by using multi-parametric feature, without sequence similarity. Firstly, we propose a method for generating novel features that present various local information of protein sequence based on positively and negatively charged residues. Then, we introduce a process of making optimal feature subset through combination of traditional and novel features extracted from protein sequence. Finally, we classify ligase enzymes by support vector machine (SVM). In experiment, only 375 out of 483 features were selected by feature selection, and the classification accuracy for 4th sub-classes in Enzyme Commission (EC) number is 98.35%. Our results demonstrate that most of novel features are valuable for specific enzyme function classification.
机译:生物信息学的重要目标之一是分类和预测蛋白质的功能,其没有已知功能的序列同源物。本文的目的是通过使用多参数特征来分类蛋白质功能,而无需序列相似度。首先,我们提出了一种基于正极和带负电的残留物产生呈现蛋白质序列各种局部信息的新特征的方法。然后,我们介绍通过从蛋白质序列中提取的传统和新特征的组合来制造最佳特征子集的过程。最后,我们通过支持向量机(SVM)来分类连接酶酶。在实验中,通过特征选择选择了483个特征中的375个,并且酶委员会(EC)编号的第4个子类的分类精度为98.35%。我们的结果表明,大多数新功能对于特定的酶功能分类是有价值的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号