首页> 外文期刊>Bioinformatics >Accurate prediction of bacterial type IV secreted effectors using amino acid composition and PSSM profiles
【24h】

Accurate prediction of bacterial type IV secreted effectors using amino acid composition and PSSM profiles

机译:使用氨基酸组成和PSSM谱准确预测IV型细菌分泌的效应子

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: Various human pathogens secret effector proteins into hosts cells via the type IV secretion system (T4SS). These proteins play important roles in the interaction between bacteria and hosts. Computational methods for T4SS effector prediction have been developed for screening experimental targets in several isolated bacterial species; however, widely applicable prediction approaches are still unavailable Results: In this work, four types of distinctive features, namely, amino acid composition, dipeptide composition, .position-specific scoring matrix composition and auto covariance transformation of position-specific scoring matrix, were calculated from primary sequences. A classifier, T4EffPred, was developed using the support vector machine with these features and their different combinations for effector prediction. Various theoretical tests were performed in a newly established dataset, and the results were measured with four indexes. We demonstrated that T4EffPred can discriminate IVA and IVB effectors in benchmark datasets with positive rates of 76.7% and 89.7%, respectively. The overall accuracy of 95.9% shows that the present method is accurate for distinguishing the T4SS effector in unidentified sequences. A classifier ensemble was designed to synthesize all single classifiers. Notable performance improvement was observed using this ensemble system in benchmark tests. To demonstrate the model's application, a genome-scale prediction of effectors was performed in Bartonella henselae, an important zoonotic pathogen. A number of putative candidates were distinguished.
机译:动机:各种人类病原体通过IV型分泌系统(T4SS)将效应蛋白分泌到宿主细胞中。这些蛋白质在细菌和宿主之间的相互作用中起重要作用。已经开发出用于预测T4SS效应子的计算方法,用于筛选几种分离的细菌物种中的实验目标。然而,仍然无法获得广泛适用的预测方法。结果:在这项工作中,计算了四种类型的显着特征,即氨基酸组成,二肽组成,位置特异性得分矩阵组成和位置特异性得分矩阵的自协方差转化来自主要序列。使用支持向量机开发了分类器T4EffPred,这些分类器具有这些功能及其不同的组合,可用于效应子预测。在一个新建立的数据集中进行了各种理论测试,并使用四个指标测量了结果。我们证明了T4EffPred可以区分基准数据集中的IVA和IVB效应子,阳性率分别为76.7%和89.7%。 95.9%的总体准确性表明,本方法可准确区分未鉴定序列中的T4SS效应子。分类器集合被设计为合成所有单个分类器。使用该集成系统在基准测试中观察到了显着的性能改进。为了证明该模型的应用,在重要的人畜共患病病原体巴尔通体中进行了效应子的基因组规模预测。区分了许多假定的候选人。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号