首页> 外文期刊>DNA research: an international journal for rapid publication of reports on genes and genomes >DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data
【24h】

DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data

机译:DDBJ读取注释管道:用于下一代测序数据高通量分析的基于云计算的管道

获取原文
           

摘要

High-performance next-generation sequencing (NGS) technologies are advancing genomics and molecular biological research. However, the immense amount of sequence data requires computational skills and suitable hardware resources that are a challenge to molecular biologists. The DNA Data Bank of Japan (DDBJ) of the National Institute of Genetics (NIG) has initiated a cloud computing-based analytical pipeline, the DDBJ Read Annotation Pipeline (DDBJ Pipeline), for a high-throughput annotation of NGS reads. The DDBJ Pipeline offers a user-friendly graphical web interface and processes massive NGS datasets using decentralized processing by NIG supercomputers currently free of charge. The proposed pipeline consists of two analysis components: basic analysis for reference genome mapping and de novo assembly and subsequent high-level analysis of structural and functional annotations. Users may smoothly switch between the two components in the pipeline, facilitating web-based operations on a supercomputer for high-throughput data analysis. Moreover, public NGS reads of the DDBJ Sequence Read Archive located on the same supercomputer can be imported into the pipeline through the input of only an accession number. This proposed pipeline will facilitate research by utilizing unified analytical workflows applied to the NGS data. The DDBJ Pipeline is accessible at http://p.ddbj.nig.ac.jp/.
机译:高性能下一代测序(NGS)技术正在推动基因组学和分子生物学研究。但是,大量的序列数据需要计算技能和合适的硬件资源,这对分子生物学家是一个挑战。国立遗传学研究所(NIG)的日本DNA数据库(DDBJ)已启动基于云计算的分析管道,即DDBJ读取注释管道(DDBJ管道),用于NGS读取的高通量注释。 DDBJ管道提供了用户友好的图形化Web界面,并通过NIG超级计算机的分散处理(目前免费)来处理大量NGS数据集。拟议中的流程包括两个分析组件:用于参考基因组图谱和从头组装的基本分析,以及随后对结构和功能注释的高级分析。用户可以在管道中的两个组件之间顺畅地切换,从而有助于在超级计算机上进行基于Web的操作以进行高通量数据分析。而且,位于同一超级计算机上的DDBJ序列读取档案的公共NGS读取可以通过仅输入登录号的方式导入到管道中。该拟议中的管道将利用应用于NGS数据的统一分析工作流程来促进研究。可通过http://p.ddbj.nig.ac.jp/访问DDBJ管道。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号