首页> 外文期刊>Journal of Theoretical Biology >Extracting features from protein sequences to improve deep extreme learning machine for protein fold recognition
【24h】

Extracting features from protein sequences to improve deep extreme learning machine for protein fold recognition

机译:从蛋白质序列中提取特征,改善蛋白质折叠识别的深度极端学习机

获取原文
获取原文并翻译 | 示例
           

摘要

Protein fold recognition is an important problem in bioinformatics to predict three-dimensional structure of a protein. One of the most challenging tasks in protein fold recognition problem is the extraction of efficient features from the amino-acid sequences to obtain better classifiers. In this paper, we have proposed six descriptors to extract features from protein sequences. These descriptors are applied in the first stage of a three-stage framework PCA-DELM-LDA to extract feature vectors from the amino-acid sequences. Principal Component Analysis PCA has been implemented to reduce the number of extracted features. The extracted feature vectors have been used with original features to improve the performance of the Deep Extreme Learning Machine DELM in the second stage. Four new features have been extracted from the second stage and used in the third stage by Linear Discriminant Analysis LDA to classify the instances into 27 folds. The proposed framework is implemented on the independent and combined feature sets in SCOP datasets. The experimental results show that extracted feature vectors in the first stage could improve the performance of DELM in extracting new useful features in second stage. (C) 2017 Elsevier Ltd. All rights reserved.
机译:蛋白质折叠识别是生物信息学中的重要问题,以预测蛋白质的三维结构。蛋白质折叠识别问题中最具挑战性的任务之一是从氨基酸序列中提取有效特征,以获得更好的分类器。在本文中,我们提出了六个描述符以从蛋白质序列中提取特征。这些描述符应用于三级框架PCA-DELM-LDA的第一阶段,以从氨基酸序列中提取特征载体。已经实施了主成分分析PCA以减少提取的功能的数量。提取的特征向量已被用于原始特征,以提高第二阶段深度极端学习机DELM的性能。从第二阶段提取了四个新功能,并通过线性判别分析LDA在第三阶段使用,将该实例分类为27倍。所提出的框架是在SCOP数据集中的独立和组合特征集上实现的。实验结果表明,第一阶段中提取的特征向量可以提高DELM在第二阶段提取新的有用特征方面的性能。 (c)2017 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号