首页> 外文期刊>DNA research: an international journal for rapid publication of reports on genes and genomes >Proteome-wide prediction of novel DNA/RNA-binding proteins using amino acid composition and periodicity in the hyperthermophilic archaeon Pyrococcus furiosus.
【24h】

Proteome-wide prediction of novel DNA/RNA-binding proteins using amino acid composition and periodicity in the hyperthermophilic archaeon Pyrococcus furiosus.

机译:在超嗜热古生火球菌中使用氨基酸组成和周期性预测蛋白质组范围内的新型DNA / RNA结合蛋白。

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Proteins play a critical role in complex biological systems, yet about half of the proteins in publicly available databases are annotated as functionally unknown. Proteome-wide functional classification using bioinformatics approaches thus is becoming an important method for revealing unknown protein functions. Using the hyperthermophilic archaeon Pyrococcus furiosus as a model species, we used the support vector machine (SVM) method to discriminate DNA/RNA-binding proteins from proteins with other functions, using amino acid composition and periodicities as feature vectors. We defined this value as the composition score (CO) and periodicity score (PD). The P. furiosus proteins were classified into three classes (I-III) on the basis of the two-dimensional correlation analysis of CO score and PD score. As a result, approximately 87% of the functionally known proteins categorized as class I proteins (CO score + PD score > 0.6) were found to be DNA/RNA-binding proteins. Applying the two-dimensional correlation analysis to the 994 hypothetical proteins in P. furiosus, a total of 151 proteins were predicted to be novel DNA/RNA-binding protein candidates. DNA/RNA-binding activities of randomly chosen hypothetical proteins were experimentally verified. Six out of seven candidate proteins in class I possessed DNA/RNA-binding activities, supporting the efficacy of our method.
机译:蛋白质在复杂的生物系统中起着至关重要的作用,但公开数据库中约有一半的蛋白质被标注为功能未知。因此,使用生物信息学方法进行蛋白质组范围的功能分类正成为揭示未知蛋白质功能的重要方法。我们以嗜热古生热球菌为模型物种,使用支持向量机(SVM)方法,以氨基酸组成和周期性为特征载体,将DNA / RNA结合蛋白与具有其他功能的蛋白区分开。我们将此值定义为构图得分(CO)和周期性得分(PD)。根据CO评分和PD评分的二维相关分析,将P. furiosus蛋白分为三类(I-III)。结果,发现分类为I类蛋白质(CO评分+ PD评分> 0.6)的约87%的功能已知蛋白质为DNA / RNA结合蛋白质。将二维相关性分析应用到激烈疟原虫中的994个假设蛋白质上,预计共有151个蛋白质是新型的DNA / RNA结合蛋白候选物。实验验证了随机选择的假设蛋白质的DNA / RNA结合活性。 I类7种候选蛋白质中有6种具有DNA / RNA结合活性,支持了我们方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号