Bio-Docklets: virtualization containers for single-step execution of NGS pipelines

Baekdoo Kim; Thahmina Ali; Carlos Lijeron; Enis Afgan; Konstantinos Krampis

首页> 外文期刊>GigaScience >Bio-Docklets: virtualization containers for single-step execution of NGS pipelines

【24h】

Bio-Docklets: virtualization containers for single-step execution of NGS pipelines

机译：Bio-Docklets：用于一步执行NGS管道的虚拟化容器

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Processing of next-generation sequencing (NGS) data requires significant technical skills, involving installation, configuration, and execution of bioinformatics data pipelines, in addition to specialized postanalysis visualization and data mining software. In order to address some of these challenges, developers have leveraged virtualization containers toward seamless deployment of preconfigured bioinformatics software and pipelines on any computational platform. We present an approach for abstracting the complex data operations of multistep, bioinformatics pipelines for NGS data analysis. As examples, we have deployed 2 pipelines for RNA sequencing and chromatin immunoprecipitation sequencing, preconfigured within Docker virtualization containers we call Bio-Docklets. Each Bio-Docklet exposes a single data input and output endpoint and from a user perspective, running the pipelines as simply as running a single bioinformatics tool. This is achieved using a “meta-script” that automatically starts the Bio-Docklets and controls the pipeline execution through the BioBlend software library and the Galaxy Application Programming Interface. The pipeline output is postprocessed by integration with the Visual Omics Explorer framework, providing interactive data visualizations that users can access through a web browser. Our goal is to enable easy access to NGS data analysis pipelines for nonbioinformatics experts on any computing environment, whether a laboratory workstation, university computer cluster, or a cloud service provider. Beyond end users, the Bio-Docklets also enables developers to programmatically deploy and run a large number of pipeline instances for concurrent analysis of multiple datasets.

机译：下一代测序（NGS）数据的处理除了专门的分析后可视化和数据挖掘软件外，还需要重要的技术技能，包括生物信息学数据管道的安装，配置和执行。为了应对这些挑战中的某些挑战，开发人员已利用虚拟化容器在任何计算平台上无缝部署预配置的生物信息学软件和管道。我们提出了一种抽象方法，用于NGS数据分析的多步骤生物信息学管道的复杂数据操作。作为示例，我们已经部署了2条用于RNA测序和染色质免疫沉淀测序的管道，这些管道在Docker虚拟化容器（称为Bio-Docklets）中进行了预配置。每个Bio-Docklet都公开了一个数据输入和输出端点，并且从用户的角度出发，运行管道就像运行单个生物信息学工具一样简单。这可以通过“元脚本”来实现，该脚本可自动启动Bio-Docklets并通过BioBlend软件库和Galaxy应用程序编程接口控制管道执行。通过与Visual Omics Explorer框架集成来对管道输出进行后处理，从而提供用户可以通过Web浏览器访问的交互式数据可视化。我们的目标是使非生物信息学专家在任何计算环境（实验室工作站，大学计算机集群或云服务提供商）上均可轻松访问NGS数据分析管道。除了最终用户之外，Bio-Docklets还使开发人员能够以编程方式部署和运行大量管道实例，以便同时分析多个数据集。

著录项

来源
《GigaScience》 |2017年第8期|共7页
作者
Baekdoo Kim; Thahmina Ali; Carlos Lijeron; Enis Afgan; Konstantinos Krampis;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类医学与其他学科的关系;
关键词

相似文献

外文文献
中文文献
专利

1. Platform-Agnostic Deployment of Bioinformatics Pipelines for Clinical NGS Assays Using Containers, Infrastructure Orchestration, and Workflow Manager [J] . Kadri S., Roy S. The Journal of molecular diagnostics: JMD . 2019,第6期

机译：使用容器，基础设施编程和工作流管理器的临床NGS测定的平台 - 无障碍部署生物信息化管道
2. NG Advantage virtual pipeline reduces emissions [J] . Midstream business Group Midstream business . 2014,第6期

机译：NG Advantage虚拟管道可减少排放
3. A gearbox model for processing large volumes of data by using pipeline systems encapsulated into virtual containers [J] . Miguel Santiago-Duran, J.L. Gonzalez-Compean, Andre Brinkmann, Future generation computer systems . 2020,第May期

机译：一种通过使用封装在虚拟容器中的管道系统来处理大量数据的齿轮箱模型
4. Polymorphic Pipeline Array: A flexible multicore accelerator with virtualized execution for mobile multimedia applications [C] . Park Hyunchul, Park Yongjun, Mahlke Scott Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture . 2009

机译：多态管线阵列：一种灵活的多核加速器，具有针对移动多媒体应用的虚拟化执行
5. Secure and Trusted Execution Framework for Virtualized Workloads [D] . Kotikela, Srujan D. 2018

机译：虚拟工作负载的安全和受信任的执行框架
6. Bio-Docklets: virtualization containers for single-step execution of NGS pipelines [O] . Baekdoo Kim, Thahmina Ali, Carlos Lijeron, 2017

机译：Bio-Docklets：用于一步执行NGS管道的虚拟化容器
7. MỘT MÔ HÌNH ĐỀ XUẤT CHO BÀI TOÁN NHẬN DẠNG KÝ TỰ TRÊN CONTAINER VẬN TẢI ĐƯỜNG THỦY [O] . Lê Hoàng Thanh 2017

机译：水运容器上的特征识别问题的建议模型
8. Latvijas gaze. Report on project execution and finalization. Inspection of pipelines, phase 2. Sloka branch pipeline [R] . 1997

机译：Latvijas凝视。关于项目执行和最终确定的报告。管道检查，第2阶段.sloka支管道

Bio-Docklets: virtualization containers for single-step execution of NGS pipelines

摘要

著录项

相似文献

相关主题

期刊订阅