...
首页> 外文期刊>BMC Bioinformatics >Classification of protein quaternary structure by functional domain composition
【24h】

Classification of protein quaternary structure by functional domain composition

机译:通过功能域组成对蛋白质四级结构进行分类

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Background The number and the arrangement of subunits that form a protein are referred to as quaternary structure. Quaternary structure is an important protein attribute that is closely related to its function. Proteins with quaternary structure are called oligomeric proteins. Oligomeric proteins are involved in various biological processes, such as metabolism, signal transduction, and chromosome replication. Thus, it is highly desirable to develop some computational methods to automatically classify the quaternary structure of proteins from their sequences. Results To explore this problem, we adopted an approach based on the functional domain composition of proteins. Every protein was represented by a vector calculated from the domains in the PFAM database. The nearest neighbor algorithm (NNA) was used for classifying the quaternary structure of proteins from this information. The jackknife cross-validation test was performed on the non-redundant protein dataset in which the sequence identity was less than 25%. The overall success rate obtained is 75.17%. Additionally, to demonstrate the effectiveness of this method, we predicted the proteins in an independent dataset and achieved an overall success rate of 84.11% Conclusion Compared with the amino acid composition method and Blast, the results indicate that the domain composition approach may be a more effective and promising high-throughput method in dealing with this complicated problem in bioinformatics.
机译:背景技术形成蛋白质的亚基的数量和排列被称为四级结构。第四级结构是与其功能密切相关的重要蛋白质属性。具有四级结构的蛋白质称为寡聚蛋白质。寡聚蛋白参与各种生物过程,例如代谢,信号转导和染色体复制。因此,非常需要开发一些计算方法以根据其序列自动对蛋白质的四级结构进行分类。结果为了探讨这一问题,我们采用了一种基于蛋白质功能域组成的方法。每种蛋白质均由从PFAM数据库中的域计算得出的载体代表。最近邻居算法(NNA)用于根据该信息对蛋白质的四级结构进行分类。对序列同一性小于25%的非冗余蛋白数据集进行了折刀交叉验证测试。获得的总体成功率为75.17%。此外,为了证明该方法的有效性,我们在独立的数据集中预测了蛋白质,并获得了84.11%的总体成功率。结论与氨基酸组成法和Blast相比,结果表明域组成法可能更有效。解决生物信息学中这一复杂问题的有效且有前途的高通量方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号