首页> 外文会议>Asian conference on intelligent information and database systems >Ensemble of Diversely Trained Support Vector Machines for Protein Fold Recognition
【24h】

Ensemble of Diversely Trained Support Vector Machines for Protein Fold Recognition

机译:用于蛋白质折叠识别的受过训练的支持向量机的集合

获取原文

摘要

Protein Fold Recognition (PFR) is defined as assigning a given protein to a fold based on its major secondary structure. PFR is considered as an important step toward protein structure prediction and drug design. However, it still remains as an unsolved problem for biological science and bioinformatics. In this study, we explore the impact of two novel feature extraction methods namely overlapped segmented distribution and overlapped segmented autocorrelation to provide more local discriminatory information for the PFR compared to previously proposed methods found in the literature. We study the impact of our proposed feature extraction methods using 15 promising physicochemical attributes of the amino acids. Afterwards, by proposing an ensemble Support Vector Machines (SVM) which are diversely trained using features extracted from different physicochemical-based attributes, we enhance the protein fold prediction accuracy for up to 5% better than similar studies found in the literature.
机译:蛋白质折叠识别(PFR)定义为根据主要的二级结构将给定的蛋白质分配给折叠。 PFR被认为是迈向蛋白质结构预测和药物设计的重要一步。然而,对于生物科学和生物信息学来说,它仍然是一个未解决的问题。在这项研究中,我们探索了两种新颖的特征提取方法的影响,即重叠分段分布和重叠分段自相关,以提供与文献中先前提出的方法相比更多的PFR本地鉴别信息。我们使用氨基酸的15种有希望的理化特性研究了我们提出的特征提取方法的影响。之后,通过提出一个集合支持向量机(SVM),该向量集使用从基于不同理化属性的特征中提取的特征进行了多种训练,与文献中的类似研究相比,我们将蛋白质折叠预测的准确性提高了5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号