首页> 外文期刊>Journal of Integrative Bioinformatics >Integrated Automatic Workflow for Phylogenetic Tree Analysis Using Public Access and Local Web Services
【24h】

Integrated Automatic Workflow for Phylogenetic Tree Analysis Using Public Access and Local Web Services

机译:使用公共访问和本地Web服务进行系统树分析的集成自动工作流

获取原文
           

摘要

At the present, coding sequence (CDS) has been discovered and larger CDS is being revealed frequently. Approaches and related tools have also been developed and upgraded concurrently, especially for phylogenetic tree analysis. This paper proposes an integrated automatic Taverna workflow for the phylogenetic tree inferring analysis using public access web services at European Bioinformatics Institute (EMBL-EBI) and Swiss Institute of Bioinformatics (SIB), and our own deployed local web services. The workflow input is a set of CDS in the Fasta format. The workflow supports 1,000 to 20,000 numbers in bootstrapping replication. The workflow performs the tree inferring such as Parsimony (PARS), Distance Matrix - Neighbor Joining (DIST-NJ), and Maximum Likelihood (ML) algorithms of EMBOSS PHYLIPNEW package based on our proposed Multiple Sequence Alignment (MSA) similarity score. The local web services are implemented and deployed into two types using the Soaplab2 and Apache Axis2 deployment. There are SOAP and Java Web Service (JWS) providing WSDL endpoints to Taverna Workbench, a workflow manager. The workflow has been validated, the performance has been measured, and its results have been verified. Our workflow’s execution time is less than ten minutes for inferring a tree with 10,000 replicates of the bootstrapping numbers. This paper proposes a new integrated automatic workflow which will be beneficial to the bioinformaticians with an intermediate level of knowledge and experiences. The all local services have been deployed at our portal http://bioservices.sci.psu.ac.th.
机译:目前,已经发现了编码序列(CDS),并且经常发现较大的CDS。还同时开发和升级了方法和相关工具,尤其是对于系统树分析。本文为欧洲生物信息学研究所(EMBL-EBI)和瑞士生物信息学研究所(SIB)的公共访问Web服务,以及我们自己部署的本地Web服务,提出了一个集成的Taverna自动工作流,用于系统发育树推断分析。工作流输入是一组Fasta格式的CDS。该工作流在引导复制中支持1,000到20,000个数字。该工作流基于我们建议的多重序列比对(MSA)相似度评分执行树推断,例如简约(PARS),距离矩阵-邻居连接(DIST-NJ)和EMBOSS PHYLIPNEW包的最大似然(ML)算法。使用Soaplab2和Apache Axis2部署将本地Web服务实现并部署为两种类型。有SOAP和Java Web Service(JWS)为工作流管理器Taverna Workbench提供WSDL端点。工作流程已经过验证,性能已得到衡量,其结果已得到验证。我们的工作流程的执行时间不到10分钟,可以推断出包含10,000个自举编号的树。本文提出了一种新的集成自动工作流程,这对具有中等知识和经验水平的生物信息学家将是有益的。所有本地服务都已在我们的门户网站http://bioservices.sci.psu.ac.th上部署。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号