首页> 外文OA文献 >A sparse autoencoder-based deep neural network for protein solvent accessibility and contact number prediction
【2h】

A sparse autoencoder-based deep neural network for protein solvent accessibility and contact number prediction

机译:基于稀疏的自动化器基础神经网络,用于蛋白质溶剂可访问性和接触数预测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Abstract Background Direct prediction of the three-dimensional (3D) structures of proteins from one-dimensional (1D) sequences is a challenging problem. Significant structural characteristics such as solvent accessibility and contact number are essential for deriving restrains in modeling protein folding and protein 3D structure. Thus, accurately predicting these features is a critical step for 3D protein structure building. Results In this study, we present DeepSacon, a computational method that can effectively predict protein solvent accessibility and contact number by using a deep neural network, which is built based on stacked autoencoder and a dropout method. The results demonstrate that our proposed DeepSacon achieves a significant improvement in the prediction quality compared with the state-of-the-art methods. We obtain 0.70 three-state accuracy for solvent accessibility, 0.33 15-state accuracy and 0.74 Pearson Correlation Coefficient (PCC) for the contact number on the 5729 monomeric soluble globular protein dataset. We also evaluate the performance on the CASP11 benchmark dataset, DeepSacon achieves 0.68 three-state accuracy and 0.69 PCC for solvent accessibility and contact number, respectively. Conclusions We have shown that DeepSacon can reliably predict solvent accessibility and contact number with stacked sparse autoencoder and a dropout approach.
机译:摘要背景从一维(1D)序列中蛋白质三维(3D)结构的直接预测是一个具有挑战性的问题。显着的结构特性,如溶剂可访问性和接触号是必不可少的,用于导出蛋白质折叠和蛋白3D结构的抑制。因此,准确地预测这些特征是3D蛋白质结构建筑的关键步骤。结果在本研究中,我们呈现了DeepSacon,一种计算方法,可以通过使用深度神经网络有效地预测蛋白质溶剂可访问性和接触号,这是基于堆叠的自动控制器和丢弃方法构建的。结果表明,与最先进的方法相比,我们所提出的深度达到预测质量的显着改善。我们获得0.70三种三种精度,用于溶剂可接近性,0.33个15个状态精度和0.74 Pearson相关系数(PCC),用于5729单体可溶性球状蛋白质集上的接触号。我们还评估了CASP11基准数据集的性能,DeepSacon分别实现了0.68个三种精度和0.69 PCC的溶剂可访问性和联系号码。结论我们已经表明,DeepSacon可以可靠地预测堆积稀疏自动化器和辍学方法的溶剂可访问性和接触号。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号