首页> 外文会议>Asian conference on intelligent information and database systems >Ensemble of Diversely Trained Support Vector Machines for Protein Fold Recognition
【24h】

Ensemble of Diversely Trained Support Vector Machines for Protein Fold Recognition

机译:用于蛋白折叠识别的多个培训的支持向量机的集合

获取原文
获取外文期刊封面目录资料

摘要

Protein Fold Recognition (PFR) is defined as assigning a given protein to a fold based on its major secondary structure. PFR is considered as an important step toward protein structure prediction and drug design. However, it still remains as an unsolved problem for biological science and bioinformatics. In this study, we explore the impact of two novel feature extraction methods namely overlapped segmented distribution and overlapped segmented autocorrelation to provide more local discriminatory information for the PFR compared to previously proposed methods found in the literature. We study the impact of our proposed feature extraction methods using 15 promising physicochemical attributes of the amino acids. Afterwards, by proposing an ensemble Support Vector Machines (SVM) which are diversely trained using features extracted from different physicochemical-based attributes, we enhance the protein fold prediction accuracy for up to 5% better than similar studies found in the literature.
机译:蛋白质折叠识别(PFR)定义为基于其主要二级结构将给定蛋白分配给折叠。 PFR被认为是达到蛋白质结构预测和药物设计的重要一步。然而,它仍然是生物科学和生物信息学的未解决问题。在这项研究中,我们探讨了两种新特征提取方法的影响即重叠的分段分布和重叠的分段自相关,以便与文献中的先前提出的方法相比,为PFR提供更多局部歧视信息。我们研究了我们所提出的特征提取方法的影响,使用氨基酸的15个有希望的物理化学属性研究。之后,通过提出使用从基于物理化学的属性中提取的特征来提取的集合支持向量机(SVM),我们提高蛋白质折叠预测精度,比文献中发现的类似研究更好地提高5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号