首页> 外文会议>Annual International Conference on Research in Computational Molecular Biology >Bagging MSA Learning: Enhancing Low-Quality PSSM with Deep Learning for Accurate Protein Structure Property Prediction
【24h】

Bagging MSA Learning: Enhancing Low-Quality PSSM with Deep Learning for Accurate Protein Structure Property Prediction

机译:套袋MSA学习:通过深度学习增强低质量PSSM,以准确预测蛋白质结构性质

获取原文

摘要

Accurate predictions of protein structure properties, e.g. secondary structure and solvent accessibility, are essential in analyzing the structure and function of a protein. PSSM (Position-Specific Scoring Matrix) features are widely used in the structure property prediction. However, some proteins may have low-quality PSSM features due to insufficient homologous sequences, leading to limited prediction accuracy. To address this limitation, we propose an enhancing scheme for PSSM features. We introduce the "Bagging MSA" method to calculate PSSM features used to train our model, and adopt a convolutional network to capture local context features and bidirectional-LSTM for long-term dependencies, and integrate them under an unsupervised framework. Structure property prediction models are then built upon such enhanced PSSM features for more accurate predictions. Empirical evaluation of CB513, CASP11, and CASP12 datasets indicate that our unsupervised enhancing scheme indeed generates more informative PSSM features for structure property prediction.
机译:蛋白质结构特性的准确预测,例如二级结构和溶剂可及性,对于分析蛋白质的结构和功能至关重要。 PSSM(特定位置评分矩阵)功能已广泛用于结构属性预测中。然而,由于同源序列不足,某些蛋白质可能具有低质量的PSSM特征,从而导致有限的预测准确性。为了解决此限制,我们提出了PSSM功能的增强方案。我们引入“ Bagging MSA”方法来计算用于训练模型的PSSM特征,并采用卷积网络来捕获局部上下文特征和双向LSTM以获取长期依赖关系,并将其集成在不受监督的框架下。然后,基于此类增强的PSSM功能构建结构特性预测模型,以进行更准确的预测。对CB513,CASP11和CASP12数据集的经验评估表明,我们的无监督增强方案的确为结构特性预测生成了更多有用的PSSM特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号