首页> 美国卫生研究院文献>other >Recognition of 27-Class Protein Folds by Adding the Interaction of Segments and Motif Information
【2h】

Recognition of 27-Class Protein Folds by Adding the Interaction of Segments and Motif Information

机译:通过添加片段和基序信息的相互作用识别27类蛋白质折叠。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The recognition of protein folds is an important step for the prediction of protein structure and function. After the recognition of 27-class protein folds in 2001 by Ding and Dubchak, prediction algorithms, prediction parameters, and new datasets for the prediction of protein folds have been improved. However, the influences of interactions from predicted secondary structure segments and motif information on protein folding have not been considered. Therefore, the recognition of 27-class protein folds with the interaction of segments and motif information is very important. Based on the 27-class folds dataset built by Liu et al., amino acid composition, the interactions of secondary structure segments, motif frequency, and predicted secondary structure information were extracted. Using the Random Forest algorithm and the ensemble classification strategy, 27-class protein folds and corresponding structural classification were identified by independent test. The overall accuracy of the testing set and structural classification measured up to 78.38% and 92.55%, respectively. When the training set and testing set were combined, the overall accuracy by 5-fold cross validation was 81.16%. In order to compare with the results of previous researchers, the method above was tested on Ding and Dubchak's dataset which has been widely used by many previous researchers, and an improved overall accuracy 70.24% was obtained.
机译:蛋白质折叠的识别是预测蛋白质结构和功能的重要步骤。在2001年Ding和Dubchak识别出27种蛋白质折叠后,预测算法,预测参数和用于预测蛋白质折叠的新数据集得到了改进。但是,尚未考虑到来自预测的二级结构片段和基序信息的相互作用对蛋白质折叠的影响。因此,通过节段和基序信息的相互作用识别27类蛋白质折叠非常重要。根据Liu等人建立的27类折叠数据集,提取了氨基酸组成,二级结构片段,基序频率和预测的二级结构信息之间的相互作用。采用随机森林算法和集成分类策略,通过独立测试确定了27类蛋白质折叠和相应的结构分类。测试仪和结构分类的整体准确度分别达到78.38%和92.55%。当训练集和测试集结合在一起时,通过5倍交叉验证的总体准确性为81.16%。为了与以前的研究人员的结果进行比较,上述方法在Ding和Dubchak的数据集上进行了测试,该数据集已被许多先前的研究人员广泛使用,并且获得了70.24%的整体准确率。

著录项

  • 期刊名称 other
  • 作者

    Zhenxing Feng; Xiuzhen Hu;

  • 作者单位
  • 年(卷),期 -1(2014),-1
  • 年度 -1
  • 页码 262850
  • 总页数 9
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号