MULKSG: MULtiple K Simultaneous Graph Assembly

机译：mulksg：多k同步图组装

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This work shows how to parallelize multi K de Bruijn graph genome assembly simultaneously, removing the bottleneck of iterative multi K assembly. The expected execution time on a single node with 40 cores is variable, with the average execution time for the entire pipeline over 16 datasets tested being 1613 s for SPAdes vs. 1581 s for MULKSG, with the MULKSG graph creation and traversal averaging 15% faster than SPAdes. We implement a multi-node implementation for the graph creation and traversal portions of the assembly, showing the speedups in Fig. 4. We show that when implemented correctly with correction phases performed per graph in parallel, the expected outcome is very close to the original method, in some cases having less errors while keeping the same NGA50 and genome coverage %. We show this works in practice, implementing with the popular genome assembler SPAdes. Further, this algorithmic change gets rid of the single node sequential bottleneck on multi K genome assembly, allowing for the use of parallel error correction, graph building, graph correction, and graph traversal. We implement a parallel version of the assembly and show the statistics are the same as when run on a single node. The code is open source and can be found at https://github.com/cwright7101/mulksg.

机译：这项工作显示了如何同时并行化多k de Bruijn曲线图基因组组件，从而消除迭代多K组件的瓶颈。具有40个核心的单个节点上的预期执行时间是可变的，具有超过16个数据集的整个流水链的平均执行时间，用于Mulksg的Spades与SPADES与SPADS与1581s）进行1613秒，Mulksg图创建和遍历平均速度较快15％而不是黑桃。我们为组装的图形创建和遍历部分实现了多节点实现，示出了图4中的加速。我们表明，当用每格并行执行的校正阶段正确实现时，预期结果非常接近原始结果方法，在某些情况下具有较少的误差，同时保持相同的NGA50和基因组覆盖率％。我们在实践中展示了这项工作，利用流行的基因组汇编器黑桃实施。此外，该算法变化将在多k基因组组件上摆脱单节点顺序瓶颈，允许使用并行误差校正，图形构建，图形校正和图形遍历。我们实现了程序集的并行版本，并显示统计信息与在单个节点上运行时的统计信息相同。代码是开源，可以在https://github.com/cwright7101/mulksg找到。

著录项

来源
《International Conference on Algorithms for Computational Biology》|2019年|223p|共12页
会议地点
作者
Christopher Wright; Sriram Krishnamoorty; Milind Kulkarni;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 Q811-53;
关键词
Genome assembly; Iterative assembly; Parallel de bruijn graph; Multi k assembly;

机译：基因组组装;迭代大会;平行de bruijn图;多K组件;

相似文献

外文文献
中文文献
专利

1. Target Assembly to Check Boresight Alignment of Active Sensors This assembly can simultaneously measure the co-boresite alignment of multiple transmitter laser beams and receiver channels. [J] . Goddard Space NASA Tech Briefs . 2011,第4期

机译：用于检查有源传感器的视轴对准的目标组件该组件可以同时测量多个发射器激光束和接收器通道的共硼铁矿对准。
2. Disassembly sequence structure graphs: An optimal approach for multiple-target selective disassembly sequence planning [J] . Shana Smith, Greg Smith, Wei-Han Chen Advanced engineering informatics . 2012,第2期

机译：拆卸序列结构图：多目标选择性拆卸序列计划的最佳方法
3. Multiple, simultaneous, independent gradients for a versatile multidimensional liquid chromatography. Part II: Application 1-Large increases in isoform resolution of human transferrin by use of dual simultaneous independent gradients of pH & acetonitrile on a mixed bed (anion exchange plus reversed phase) stationary phase [J] . Tsonev Latchezar I., Hirsh Allen G. Journal of chromatography, A: Including electrophoresis and other separation methods . 2016,第Null期

机译：适用于多维多维液相色谱的多个同时发生的独立梯度。第二部分：应用1-通过在混合床（阴离子交换加反相）固定相上同时使用pH和乙腈的双重同时独立梯度，大幅度提高人转铁蛋白的同工型分辨率
4. MULKSG: MULtiple K Simultaneous Graph Assembly [C] . Christopher Wright, Sriram Krishnamoorty, Milind Kulkarni International conference on algorithms for computational biology . 2019

机译：MULKSG：MULtiple K同时图装配
5. Use of Intramuscular Electromyography for the Simultaneous Control of Multiple Degrees of Freedom in Upper-Limb Myoelectric Prostheses [D] . Smith, Lauren Hart 2015

机译：使用肌内肌电图同时控制上肢肌电假体的多个自由度
6. A Simplified and Versatile System for the Simultaneous Expression of Multiple siRNAs in Mammalian Cells Using Gibson DNA Assembly [O] . Fang Deng, Xiang Chen, Zhan Liao, -1

机译：使用吉布森DNA组装的哺乳动物细胞中多个siRNA同时表达的简化和多功能系统。
7. Rapid and Efficient Synthetic Assembly of Multiplex Luciferase Reporter Plasmids for the Simultaneous Monitoring of Up to Six Cellular Signaling Pathways [O] . Alejandro Sarrion‐Perdigones, Yezabel Gonzalez, Koen J.T. Venken 2020

机译：多重荧光素酶报告组质粒的快速高效合成组装，用于同时监测多达六个蜂窝信号通路
8. Simultaneous Excitation of Multiple-Input Multiple-Output CFD-Based Unsteady Aerodynamic Systems [R] . Silva, Walter A. 2008

机译：基于多输入多输出CFD的非定常气动系统的同时激励

MULKSG: MULtiple K Simultaneous Graph Assembly

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅