DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data

Eli Kaminuma; Hajime Ohyanagi; Hideaki Sugawara; Hideki Nagasaki; Kousaku Okubo; Nori Kurata; Satoshi Saruhashi; Shota Morizaki; Takako Mochizuki; Toshihisa Takagi; Yasukazu Nakamura; Yuichi Kodama

首页> 外文期刊>DNA research: an international journal for rapid publication of reports on genes and genomes >DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data

【24h】

DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data

机译：DDBJ读取注释管道：用于下一代测序数据高通量分析的基于云计算的管道

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

High-performance next-generation sequencing (NGS) technologies are advancing genomics and molecular biological research. However, the immense amount of sequence data requires computational skills and suitable hardware resources that are a challenge to molecular biologists. The DNA Data Bank of Japan (DDBJ) of the National Institute of Genetics (NIG) has initiated a cloud computing-based analytical pipeline, the DDBJ Read Annotation Pipeline (DDBJ Pipeline), for a high-throughput annotation of NGS reads. The DDBJ Pipeline offers a user-friendly graphical web interface and processes massive NGS datasets using decentralized processing by NIG supercomputers currently free of charge. The proposed pipeline consists of two analysis components: basic analysis for reference genome mapping and de novo assembly and subsequent high-level analysis of structural and functional annotations. Users may smoothly switch between the two components in the pipeline, facilitating web-based operations on a supercomputer for high-throughput data analysis. Moreover, public NGS reads of the DDBJ Sequence Read Archive located on the same supercomputer can be imported into the pipeline through the input of only an accession number. This proposed pipeline will facilitate research by utilizing unified analytical workflows applied to the NGS data. The DDBJ Pipeline is accessible at http://p.ddbj.nig.ac.jp/.

机译：高性能下一代测序（NGS）技术正在推动基因组学和分子生物学研究。但是，大量的序列数据需要计算技能和合适的硬件资源，这对分子生物学家是一个挑战。国立遗传学研究所（NIG）的日本DNA数据库（DDBJ）已启动基于云计算的分析管道，即DDBJ读取注释管道（DDBJ管道），用于NGS读取的高通量注释。 DDBJ管道提供了用户友好的图形化Web界面，并通过NIG超级计算机的分散处理（目前免费）来处理大量NGS数据集。拟议中的流程包括两个分析组件：用于参考基因组图谱和从头组装的基本分析，以及随后对结构和功能注释的高级分析。用户可以在管道中的两个组件之间顺畅地切换，从而有助于在超级计算机上进行基于Web的操作以进行高通量数据分析。而且，位于同一超级计算机上的DDBJ序列读取档案的公共NGS读取可以通过仅输入登录号的方式导入到管道中。该拟议中的管道将利用应用于NGS数据的统一分析工作流程来促进研究。可通过http://p.ddbj.nig.ac.jp/访问DDBJ管道。

著录项

来源
《DNA research: an international journal for rapid publication of reports on genes and genomes》 |2013年第4期|共8页
作者
Eli Kaminuma; Hajime Ohyanagi; Hideaki Sugawara; Hideki Nagasaki; Kousaku Okubo; Nori Kurata; Satoshi Saruhashi; Shota Morizaki; Takako Mochizuki; Toshihisa Takagi; Yasukazu Nakamura; Yuichi Kodama;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类生物化学;
关键词

相似文献

外文文献
中文文献
专利

1. DDBJ read annotation pipeline: A cloud computing-based pipeline for high-throughput analysis of next-generation sequencing data [J] . NagasakiH., MochizukiT., KodamaY., DNA research: an international journal for rapid publication of reports on genes and genomes . 2013,第4期

机译：DDBJ读取注释管道：基于云计算的管道，可对下一代测序数据进行高通量分析
2. Nucleotide-Level Variant Analysis of Next-Generation Sequencing Data Using a Cloud-Based Data Analysis Pipeline [J] . G. Asimenos, A. Sundquist Journal of biomolecular techniques :JBT. . 2011,第Suppl期

机译：使用基于云的数据分析管道对下一代测序数据进行核苷酸水平的变异分析
3. Variant Call Format-Diagnostic Annotation and Reporting Tool A Customizable Analysis Pipeline for Identification of Clinically Relevant Genetic Variants in Next-Generation Sequencing Data [J] . Benton Miles C., Smith Robert A., Haupt Larisa M., The Journal of molecular diagnostics: JMD . 2019,第6期

机译：变体呼叫格式 - 诊断注释和报告工具可定制的分析管道，用于识别下一代测序数据中的临床相关的遗传变量
4. A highly parallel next-generation DNA sequencing data analysis pipeline in Hadoop [C] . Aggour Kareem S., Kumar Vijay S., Sangurdekar Dipen P., IEEE International Conference on Bioinformatics and Biomedicine . 2015

机译：Hadoop中高度并行的下一代DNA测序数据分析管道
5. Development of SRADE tool and analysis of quality scores of the reads of Next-Generation Sequencing data. [D] . Kotha, Chaitanya Krishna. 2014

机译：开发SRADE工具并分析下一代测序数据读数的质量得分。
6. DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data [O] . Hideki Nagasaki, Takako Mochizuki, Yuichi Kodama, 2013

机译：DDBJ读取注释管道：基于云计算的管道用于下一代测序数据的高通量分析
7. Read Annotation Pipeline for High-Throughput Sequencing Data [O] . James Holt, Shunping Huang, Wei Wang, 2014

机译：读取注释流水线以获取高通量测序数据

DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data

摘要

著录项

相似文献

相关主题

期刊订阅