首页> 外文期刊>The Journal of Chemical Physics >Prediction of amyloid aggregation rates by machine learning and feature selection
【24h】

Prediction of amyloid aggregation rates by machine learning and feature selection

机译:通过机器学习和特征选择预测淀粉样蛋白聚集速率

获取原文
获取原文并翻译 | 示例
           

摘要

A novel data-based machine learning algorithm for predicting amyloid aggregation rates is reported in this paper. Based on a highly nonlinear projection from 16 intrinsic features of a protein and 4 extrinsic features of the environment to the protein aggregation rate, a feedforward fully connected neural network (FCN) with one hidden layer is trained on a dataset composed of 21 different kinds of amyloid proteins and tested on 4 rest proteins. FCN shows a much better performance than traditional algorithms, such as multivariable linear regression and support vector regression, with an average accuracy higher than 90%. Furthermore, by the correlation analysis and the principal component analysis, seven key features, folding energy, HP patterns for helix, sheet and helices cross membrane, pH, ionic strength, and protein concentration, are shown to constitute a minimum feature set for characterizing the amyloid aggregation kinetics. Published under license by AIP Publishing.
机译:本文报道了一种用于预测淀粉样蛋白聚集速率的基于新的基于数据的机器学习算法。 基于从蛋白质的16个固有特征的高度非线性投影和环境的4个内在特征到蛋白质聚集速率,具有一个隐藏层的前馈完全连接的神经网络(FCN)在由21种不同类型组成的数据集上培训 淀粉样蛋白并在4次疗法上进行测试。 FCN显示出比传统算法更好的性能,例如多变量线性回归和支持向量回归,平均精度高于90%。 此外,通过相关性分析和主成分分析,七个关键特征,折叠能量,用于螺旋,螺旋的HP图案,表格和螺旋横膜,pH,离子强度和蛋白质浓度,构成了用于表征的最小特征 淀粉样蛋白聚集动力学。 通过AIP发布在许可证下发布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号