首页> 外文期刊>Bioinformatics >Prediction of protein solvent accessibility using fuzzy k-nearest neighbor method
【24h】

Prediction of protein solvent accessibility using fuzzy k-nearest neighbor method

机译:用模糊k近邻法预测蛋白质溶剂的可及性

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: The solvent accessibility of amino acid residues plays an important role in tertiary structure prediction, especially in the absence of significant sequence similarity of a query protein to those with known structures. The prediction of solvent accessibility is less accurate than secondary structure prediction in spite of improvements in recent researches. The k-nearest neighbor method, a simple but powerful classification algorithm, has never been applied to the prediction of solvent accessibility, although it has been used frequently for the classification of biological and medical data. Results: We applied the fuzzy k-nearest neighbor method to the solvent accessibility prediction, using PSI-BLAST profiles as feature vectors, and achieved high prediction accuracies. With leave-one-out cross-validation on the ASTRAL SCOP reference dataset constructed by sequence clustering, our method achieved 64.1 % accuracy for a 3-state (buried/intermediate/exposed) prediction (thresholds of 9% for buried/intermediate and 36% for intermediate/exposed) and 86.7, 82.0, 79.0 and 78.5% accuracies for 2-state (buried/exposed) predictions (thresholds of each 0, 5, 16 and 25% for buried/exposed), respectively. Our method also showed slightly better accuracies than other methods by about 2-5% on the RS126 dataset and a bench-marking dataset with 229 proteins.
机译:动机:氨基酸残基的溶剂可及性在三级结构预测中起着重要作用,尤其是在缺少查询蛋白与那些具有已知结构的蛋白的显着序列相似性的情况下。尽管最近的研究有所改进,但溶剂可及性的预测仍不如二级结构预测准确。 k近邻法是一种简单但功能强大的分类算法,尽管它已被频繁地用于生物学和医学数据的分类,但从未用于预测溶剂的可及性。结果:我们将模糊k最近邻法应用到溶剂可及性预测中,使用PSI-BLAST配置文件作为特征向量,并获得了较高的预测精度。通过对通过序列聚类构建的ASTRAL SCOP参考数据集进行一劳永逸的交叉验证,我们的方法对三态(埋入/中间/暴露)预测的准确性达到了64.1%(埋入/中间的阈值为9%,36的阈值为36%)。 %(中间/暴露)的准确度和86.7、82.0、79.0和78.5%的2状态(掩埋/暴露)预测的准确度(掩埋/暴露的阈值分别为0、5、16和25%)。我们的方法在RS126数据集和具有229种蛋白质的基准分析数据集上也显示出比其他方法更好的准确性,约为2-5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号