2+) are crucial for protein function. They participate in enzyme catalysis, play regulat'/> Sequence-based protein-Ca2+binding site prediction using SVM classifier ensemble with random under-sampling
首页> 外文会议>International Conference on Progress in Informatics and Computing >Sequence-based protein-Ca2+binding site prediction using SVM classifier ensemble with random under-sampling
【24h】

Sequence-based protein-Ca2+binding site prediction using SVM classifier ensemble with random under-sampling

机译:基于序列的蛋白质-Ca 2 + 结合位点的预测,采用SVM分类器集成,随机欠采样

获取原文

摘要

Calcium ions (Ca2+) are crucial for protein function. They participate in enzyme catalysis, play regulatory roles, and help maintain protein structure. Accurately recognizing Ca2+-binding sites is of significant importance for protein function analysis. Although much progress has been made, challenges remain, especially in the post-genome era where large volume of proteins without being functional annotated are quickly accumulated. In this study, we design a new ab initio predictor, CaSite, to identify Ca2+-binding residues from protein sequence. CaSite first uses evolutionary information, predicted secondary structure, predicted solvent accessibility, and Jensen-Shannon divergence information to represent each residue sample feature. A mean ensemble classifier constructed based on support vector machines (SVM) from multiple random under-samplings is used as the final prediction model, which is effective for relieving the negative influence of the imbalance phenomenon between positive and negative training samples. Experimental results demonstrate that the proposed CaSite achieves a better prediction performance and outperforms the existing sequence-based predictor, Targets.
机译:钙离子(CA. 2 + )对蛋白质功能至关重要。他们参与酶催化,发挥调节作用,并帮助维持蛋白质结构。准确识别CA. 2 + - 困扰网站对于蛋白质功能分析具有重要意义。虽然已经取得了很大的进展,但仍然存在挑战,特别是在基因组后的时代,其中大量蛋白质不具有功能注释的蛋白质被迅速积累。在这项研究中,我们设计了一个新的AB Initio预测因子,套餐,识别CA 2 + - 粘合来自蛋白质序列的残留物。包装首先使用进化信息,预测的二级结构,预测溶剂可访问性,以及Jensen-Shannon分歧信息来代表每个残留物样本特征。一种基于来自多个随机底次采样的支持向量机(SVM)构造的平均集合分类器用作最终预测模型,这对于缓解正负训练样品之间的不平衡现象的负面影响是有效的。实验结果表明,所提出的外壳实现更好的预测性能,优于现有的基于序列的预测因子目标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号