A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms

Stephen J Goodswen; Paul J Kennedy; John T Ellis

首页> 外文期刊>BMC Bioinformatics >A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms

【24h】

A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms

机译：一种使用机器学习算法对真核病原体计算机疫苗发现管道中的输出进行分类的新策略

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background An in silico vaccine discovery pipeline for eukaryotic pathogens typically consists of several computational tools to predict protein characteristics. The aim of the in silico approach to discovering subunit vaccines is to use predicted characteristics to identify proteins which are worthy of laboratory investigation. A major challenge is that these predictions are inherent with hidden inaccuracies and contradictions. This study focuses on how to reduce the number of false candidates using machine learning algorithms rather than relying on expensive laboratory validation. Proteins from Toxoplasma gondii, Plasmodium sp., and Caenorhabditis elegans were used as training and test datasets. Results The results show that machine learning algorithms can effectively distinguish expected true from expected false vaccine candidates (with an average sensitivity and specificity of 0.97 and 0.98 respectively), for proteins observed to induce immune responses experimentally. Conclusions Vaccine candidates from an in silico approach can only be truly validated in a laboratory. Given any in silico output and appropriate training data, the number of false candidates allocated for validation can be dramatically reduced using a pool of machine learning algorithms. This will ultimately save time and money in the laboratory.

机译：背景技术用于真核病原体的计算机疫苗发现管线通常由几种预测蛋白质特征的计算工具组成。计算机模拟方法发现亚单位疫苗的目的是使用预测的特征来识别值得实验室研究的蛋白质。一个主要的挑战是，这些预测是隐藏的不准确性和矛盾所固有的。这项研究的重点是如何使用机器学习算法减少虚假候选人的数量，而不是依靠昂贵的实验室验证。来自弓形虫，疟原虫和秀丽隐杆线虫的蛋白质被用作训练和测试数据集。结果结果表明，机器学习算法可以有效区分预期的真假疫苗和预期的假疫苗候选物（平均敏感性和特异性分别为0.97和0.98），用于观察到实验诱导免疫反应的蛋白质。结论通过计算机方法获得的疫苗候选者只能在实验室中进行真正的验证。给定任何计算机输出和适当的培训数据，可以使用一组机器学习算法来显着减少分配给验证的错误候选者的数量。最终将节省实验室的时间和金钱。

著录项

来源
《BMC Bioinformatics》 |2013年第1期|共页
作者
Stephen J Goodswen; Paul J Kennedy; John T Ellis;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类生物科学;
关键词

相似文献

外文文献
中文文献
专利

1. Vacceed: a high-throughput in silico vaccine candidate discovery pipeline for eukaryotic pathogens based on reverse vaccinology [J] . Goodswen SJ, Kennedy PJ, Ellis JT Bioinformatics . 2014,第16期

机译：疫苗接种：基于反向疫苗学的真核生物病原体高通量计算机疫苗候选者发现渠道
2. A guide to in silico vaccine discovery for eukaryotic pathogens [J] . Goodswen S.J., Kennedy P.J., Ellis J.T. Briefings in bioinformatics . 2013,第6期

机译：真核生物病原体计算机疫苗发现指南
3. A comparative study of machine learning and deep learning algorithms to classify cancer types based on microarray gene expression data [J] . Reinel Tabares-Soto, Simon Orozco-Arias, Victor Romero-Cano, PeerJ Computer Science . 2020,第1期

机译：基于微阵列基因表达数据的机器学习和深度学习算法对癌症类型的比较研究
4. Pipelining machine learning algorithms for knowledge discovery [C] . Allan L. Egbert, Jr., Florida State Univ., Conference on applications and science of computational intelligence . 2000

机译：流水线式机器学习算法以进行知识发现
5. Exploring the Use of Supervised Machine Learning Algorithms to Classify Simulated Balance Deficits [D] . Sidener, Logan James. 2018

机译：探索使用监督机器学习算法来分类模拟余额赤字
6. A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms [O] . Stephen J Goodswen, Paul J Kennedy, John T Ellis 2013

机译：一种使用机器学习算法对真核病原体计算机疫苗发现管道中的输出进行分类的新策略
7. A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms [O] . Stephen J Goodswen, Paul J Kennedy, John T Ellis 2013

机译：一种使用机器学习算法对真核病原体计算机疫苗发现管道中的输出进行分类的新策略

A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms

摘要

著录项

相似文献

相关主题

期刊订阅