首页> 外文期刊>BMC Bioinformatics >Comparison of phosphorylation patterns across eukaryotes by discriminative N-gram analysis
【24h】

Comparison of phosphorylation patterns across eukaryotes by discriminative N-gram analysis

机译:通过判别性N-gram分析比较真核生物中的磷酸化模式

获取原文
           

摘要

Background How protein phosphorylation relates to kingdom/phylum divergence is largely unknown and the amino acid residues surrounding the phosphorylation site have profound importance on protein kinase–substrate interactions. Standard motif analysis is not adequate for large scale comparative analysis because each phophopeptide is assigned to a unique motif and perform poorly with the unbalanced nature of the input datasets. Results First the discriminative n-grams of five species from five different kingdom/phyla were identified. A signature with 5540 discriminative n-grams that could be found in other species from the same kingdoms/phyla was created. Using a test data set, the ability of the signature to classify species in their corresponding kingdom/phylum was confirmed using classification methods. Lastly, ortholog proteins among proteins with n-grams were identified in order to determine to what degree was the identity of the detected n-grams a property of phosphosites rather than a consequence of species-specific or kingdom/phylum-specific protein inventory. The motifs were grouped in clusters of equal physico-chemical nature and their distribution was similar between species in the same kingdom/phylum while clear differences were found among species of different kingdom/phylum. For example, the animal-specific top discriminative n-grams contained many basic amino acids and the plant-specific motifs were mainly acidic. Secondary structure prediction methods show that the discriminative n-grams in the majority of the cases lack from a regular secondary structure as on average they had 88?% of random coil compared to 66?% found in the phosphoproteins they were derived from. Conclusions The discriminative n-grams were able to classify organisms in their corresponding kingdom/phylum, they show different patterns among species of different kingdom/phylum and these regions can contribute to evolutionary divergence as they are in disordered regions that can evolve rapidly. The differences found possibly reflect group-specific differences in the kinomes of the different groups of species.
机译:背景技术蛋白质磷酸化与王国/门的分歧之间的关系在很大程度上是未知的,磷酸化位点周围的氨基酸残基对蛋白质激酶与底物的相互作用具有重要意义。标准的基序分析不足以进行大规模的比较分析,因为每个磷酸肽都被分配给一个唯一的基序,并且在输入数据集的不平衡性质下表现不佳。结果首先鉴定出来自五个不同王国/门的五个物种的判别性n-gram。创建了具有5540个判别性n-gram的签名,该签名可以在同一王国/门的其他物种中找到。使用测试数据集,使用分类方法确认了签名对相应王国/门类中的物种进行分类的能力。最后,鉴定具有n-gram的蛋白质中的直系同源蛋白质,以便确定检测到的n-gram的身份在多大程度上具有磷酸位点的特性,而不是物种特异性或王国/门类特异性蛋白质库存的结果。这些基序被分组为具有相同物理化学性质的簇,并且它们在相同王国/门的物种之间的分布相似,而在不同王国/门的物种之间发现明显的差异。例如,动物特有的最高判别性n-gram包含许多碱性氨基酸,而植物特有的基序则主要是酸性的。二级结构预测方法表明,在大多数情况下,判别性n-gram缺少规则的二级结构,因为它们平均具有88%的随机卷曲,而从其衍生的磷蛋白中发现的则为66%。结论判别性n-gram能够对相应王国/门类中的生物进行分类,它们在不同王国/门类的物种中表现出不同的模式,并且这些区域可以在进化迅速的无序区域中促进进化差异。所发现的差异可能反映了不同物种组的动力学组中特定于组的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号