首页> 外文OA文献 >Homology to peptide pattern for annotation of carbohydrate-active enzymes and prediction of function
【2h】

Homology to peptide pattern for annotation of carbohydrate-active enzymes and prediction of function

机译:用于注释碳水化合物活性酶的肽模式的同源性和功能预测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Background: Carbohydrate-active enzymes are found in all organisms and participate in key biological processes.These enzymes are classified in 274 families in the CAZy database but the sequence diversity within each family makes it a major task to identify new family members and to provide basis for prediction of enzyme function. A fastand reliable method for de novo annotation of genes encoding carbohydrate-active enzymes is to identify conserved peptides in the curated enzyme families followed by matching of the conserved peptides to the sequence of interestas demonstrated for the glycosyl hydrolase and the lytic polysaccharide monooxygenase families. This approach notonly assigns the enzymes to families but also provides functional prediction of the enzymes with high accuracy.Results: We identified conserved peptides for all enzyme families in the CAZy database with Peptide Pattern Recognition. The conserved peptides were matched to protein sequence for de novo annotation and functional prediction of carbohydrate-active enzymes with the Hotpep method. Annotation of protein sequences from 12 bacterial and 16 fungal genomes to families with Hotpep had an accuracy of 0.84 (measured as F1-score) compared to semiautomatic annotation by the CAZy database whereas the dbCAN HMM-based method had an accuracy of 0.77 with optimized parameters. Furthermore, Hotpep provided a functional prediction with 86% accuracy for the annotated genes. Hotpep is available as a stand-alone application for MS Windows. Conclusions: Hotpep is a state-of-the-art method for automatic annotation and functional prediction of carbohydrate-active enzymes.
机译:背景:碳水化合物活性酶存在于所有生物中,并参与关键的生物过程。这些酶在CAZy数据库中分类为274个家族,但每个家族的序列多样性使其成为鉴定新家族成员并提供依据的主要任务。用于预测酶的功能。一种从头开始注释编码碳水化合物活性酶的基因的快速可靠的方法是,在已整理的酶家族中鉴定保守的肽,然后将保守的肽与糖基水解酶和裂解多糖单加氧酶家族证明的目标序列进行匹配。该方法不仅将酶分配给家族,而且还提供了对酶的高精度功能预测。结果:我们通过肽模式识别在CAZy数据库中鉴定了所有酶家族的保守肽。通过Hotpep方法,将保守的肽与蛋白质序列进行匹配,以进行从头注释和碳水化合物活性酶的功能预测。与通过CAZy数据库进行的半自动注释相比,使用Hotpep从12个细菌和16个真菌基因组到家族的蛋白质序列注释的准确性为0.84(以F1分数衡量),而基于dbCAN HMM的方法使用优化参数的准确性为0.77 。此外,Hotpep提供了对带注释基因的功能预测,准确性为86%。 Hotpep可作为MS Windows的独立应用程序使用。结论:Hotpep是一种用于碳水化合物活性酶的自动注释和功能预测的最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号