...
首页> 外文期刊>Journal of Bioinformatics and Computational Biology >Dimensionality reduction for protein secondary structure and solvent accesibility prediction
【24h】

Dimensionality reduction for protein secondary structure and solvent accesibility prediction

机译:蛋白质二级结构和溶剂可接近性预测的维数减少

获取原文
获取原文并翻译 | 示例

摘要

Secondary structure and solvent accessibility prediction provide valuable information for estimating the three dimensional structure of a protein. As new feature extraction methods are developed the dimensionality of the input feature space increases steadily. Reducing the number of dimensions provides several advantages such as faster model training, faster prediction and noise elimination. In this work, several dimensionality reduction techniques have been employed including various feature selection methods, autoencoders and PCA for protein secondary structure and solvent accessibility prediction. The reduced feature set is used to train a support vector machine at the second stage of a hybrid classifier. Cross-validation experiments on two difficult benchmarks demonstrate that the dimension of the input space can be reduced substantially while maintaining the prediction accuracy. This will enable the incorporation of additional informative features derived for predicting the structural properties of proteins without reducing the accuracy due to overfitting.
机译:二次结构和溶剂可访问性预测提供了用于估计蛋白质的三维结构的有价值的信息。随着新特征提取方法的开发,输入特征空间的二维性稳定增加。减少尺寸的数量提供了若干优点,例如更快的模型训练,更快的预测和噪声消除。在这项工作中,已经采用了几种维度减少技术,包括各种特征选择方法,自身负载剂和用于蛋白质二级结构和溶剂可访问性预测的PCA。减少的特征集用于在混合分级器的第二阶段训练支持向量机。两个困难基准测试的交叉验证实验表明,在保持预测精度的同时,可以基本上减小输入空间的尺寸。这将使衍生的额外信息特征能够掺入预测蛋白质的结构性,而不会降低由于过度装备引起的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号