首页> 外文期刊>Acta Biotheoretica >Prediction of Protein Secondary Structure Using Feature Selection and Analysis Approach
【24h】

Prediction of Protein Secondary Structure Using Feature Selection and Analysis Approach

机译:基于特征选择和分析方法的蛋白质二级结构预测

获取原文
获取原文并翻译 | 示例
       

摘要

The prediction of the secondary structure of a protein from its amino acid sequence is an important step towards the prediction of its three-dimensional structure. However, the accuracy of ab initio secondary structure prediction from sequence is about 80 % currently, which is still far from satisfactory. In this study, we proposed a novel method that uses binomial distribution to optimize tetrapeptide structural words and increment of diversity with quadratic discriminant to perform prediction for protein three-state secondary structure. A benchmark dataset including 2,640 proteins with sequence identity of less than 25 % was used to train and test the proposed method. The results indicate that overall accuracy of 87.8 % was achieved in secondary structure prediction by using ten-fold cross-validation. Moreover, the accuracy of predicted secondary structures ranges from 84 to 89 % at the level of residue. These results suggest that the feature selection technique can detect the optimized tetrapeptide structural words which affect the accuracy of predicted secondary structures.
机译:从蛋白质的氨基酸序列预测蛋白质的二级结构是朝预测其三维结构迈出的重要一步。但是,目前从序列开始从头开始预测二级结构的准确性约为80%,仍然远远不能令人满意。在这项研究中,我们提出了一种新的方法,该方法使用二项式分布来优化四肽结构词和具有二次判别的多样性增加来预测蛋白质三态二级结构。一个基准数据集包括2,640个蛋白质,其序列同一性小于25%,用于训练和测试该方法。结果表明,通过使用十倍交叉验证,二级结构预测的整体准确性达到了87.8%。此外,预测的二级结构的准确性在残基水平上为84%到89%。这些结果表明,特征选择技术可以检测到优化的四肽结构词,从而影响预测的二级结构的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号