首页> 外文OA文献 >A novel hybrid method of β-turn identification in protein using binary logistic regression and neural network
【2h】

A novel hybrid method of β-turn identification in protein using binary logistic regression and neural network

机译:基于二进制逻辑回归和神经网络的蛋白质β-转角鉴定新方法

摘要

From both the structural and functional points of view, β-turns play important biological roles in proteins. In the present study, a novel two-stage hybrid procedure has been developed to identify β-turns in proteins. Binary logistic regression was initially used for the first time toselect significant sequence parameters in identification of β-turns due to a re-substitution test procedure. Sequence parameters were consisted of 80 amino acid positional occurrences and 20 amino acid percentages in sequence. Among these parameters, the most significant oneswhich were selected by binary logistic regression model, were percentages of Gly, Ser and the occurrence of Asn in position i+2, respectively, in sequence. These significant parameters have the highest effect on the constitution of a β-turn sequence. A neural network model wasthen constructed and fed by the parameters selected by binary logistic regression to build a hybrid predictor. The networks have been trained and tested on a non-homologous dataset of 565 protein chains. With applying a nine fold cross-validation test on the dataset, the network reached an overall accuracy (Qtotal) of 74, which is comparable with results of the other β-turnprediction methods. In conclusion, this study proves that the parameter selection ability of binary logistic regression together with the prediction capability of neural networks lead to the development of more precise models for identifying β-turns in proteins.
机译:从结构和功能的角度来看,β-转角在蛋白质中起重要的生物学作用。在本研究中,已经开发出新颖的两阶段杂交程序来鉴定蛋白质中的β-转角。最初使用二元逻辑回归分析来选择重要的序列参数,以识别由于重新替代测试程序而产生的β-转弯。序列参数由80个氨基酸位置出现和20个氨基酸百分比组成。在这些参数中,由二元逻辑回归模型选择的最重要的参数分别是Gly,Ser的百分比和i + 2位置Asn的出现顺序。这些重要参数对β-转角序列的构成具有最大影响。然后,通过二元逻辑回归选择的参数构建并馈入神经网络模型,以构建混合预测器。该网络已在565条蛋白质链的非同源数据集中进行了培训和测试。通过对数据集应用九折交叉验证测试,该网络达到了74的总体准确性(Qtotal),与其他β预测方法的结果相当。总而言之,这项研究证明了二进制逻辑回归的参数选择能力以及神经网络的预测能力导致了开发更精确的模型来鉴定蛋白质中的β-转角。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号