首页> 外文期刊>Scientific reports. >Deeper Profiles and Cascaded Recurrent and Convolutional Neural Networks for state-of-the-art Protein Secondary Structure Prediction
【24h】

Deeper Profiles and Cascaded Recurrent and Convolutional Neural Networks for state-of-the-art Protein Secondary Structure Prediction

机译:更深的曲线和级联反复化和卷积神经网络,用于最先进的蛋白质二级结构预测

获取原文
           

摘要

Protein Secondary Structure prediction has been a central topic of research in Bioinformatics for decades. In spite of this, even the most sophisticated ab initio SS predictors are not able to reach the theoretical limit of three-state prediction accuracy (88-90%), while only a few predict more than the 3 traditional Helix, Strand and Coil classes. In this study we present tests on different models trained both on single sequence and evolutionary profile-based inputs and develop a new state-of-the-art system with Porter 5. Porter 5 is composed of ensembles of cascaded Bidirectional Recurrent Neural Networks and Convolutional Neural Networks, incorporates new input encoding techniques and is trained on a large set of protein structures. Porter 5 achieves 84% accuracy (81% SOV) when tested on 3 classes and 73% accuracy (70% SOV) on 8 classes on a large independent set. In our tests Porter 5 is 2% more accurate than its previous version and outperforms or matches the most recent predictors of secondary structure we tested. When Porter 5 is retrained on SCOPe based sets that eliminate homology between training/testing samples we obtain similar results. Porter is available as a web server and standalone program at http://distilldeep.ucd.ie/porter/ alongside all the datasets and alignments.
机译:几十年来,蛋白质二级结构预测是生物信息学研究的核心主题。尽管如此,即使是最复杂的AB Initio SS预测器也无法达到三态预测精度的理论极限(88-90%),而只有少数预测超过3个传统螺旋,股线和线圈课程。在这项研究中,我们在单个序列和基于进化的简档输入上培训的不同模型的测试以及开发了具有Porter 5. Porter 5的新型系统由级联双向经常性神经网络和卷积的集合组成神经网络,包括新的输入编码技术,并在大量的蛋白质结构上培训。搬运工5在大型独立集合上的3个类和73%的精度(70%SOV)上测试时,达到84%的精度(81%SOV)。在我们的测试中,Porter 5比以前的版本和优于我们测试的二级结构的最新预测器更准确,或者匹配的比例更准确。当在基于范围的集合上被检测到培训/测试样本之间消除同源的组时,我们获得类似的结果。 Porter可在http://distilldeep.ucd.ie/porter/旁边作为Web服务器和独立程序提供,并沿所有数据集和对齐方式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号