首页> 外文期刊>BMC Bioinformatics >IonCRAM: a reference-based compression tool for ion torrent sequence files
【24h】

IonCRAM: a reference-based compression tool for ion torrent sequence files

机译:Ioncram:用于离子洪流序列文件的基于参考压缩工具

获取原文
           

摘要

Ion Torrent is one of the major next generation sequencing (NGS) technologies and it is frequently used in medical research and diagnosis. The built-in software for the Ion Torrent sequencing machines delivers the sequencing results in the BAM format. In addition to the usual SAM/BAM fields, the Ion Torrent BAM file includes technology-specific flow signal data. The flow signals occupy a big portion of the BAM file (about 75% for the human genome). Compressing SAM/BAM into CRAM format significantly reduces the space needed to store the NGS results. However, the tools for generating the CRAM formats are not designed to handle the flow signals. This missing feature has motivated us to develop a new program to improve the compression of the Ion Torrent files for long term archiving. In this paper, we present IonCRAM, the first reference-based compression tool to compress Ion Torrent BAM files for long term archiving. For the BAM files, IonCRAM could achieve a space saving of about 43%. This space saving is superior to what achieved with the CRAM format by about 8–9%. Reducing the space consumption of NGS data reduces the cost of storage and data transfer. Therefore, developing efficient compression software for clinical NGS data goes beyond the computational interest; as it ultimately contributes to the overall cost reduction of the clinical test. The space saving achieved by our tool is a practical step in this direction. The tool is open source and available at Code Ocean, github, and http://ioncram.saudigenomeproject.com .
机译:离子Torrent是下一个主要的下一代测序(NGS)技术之一,它经常用于医学研究和诊断。用于离子流序机的内置软件可提供BAM格式的序列结果。除了通常的SAM / BAM字段之外,离子Torrent BAM文件还包括特定于技术的流量信号数据。流量信号占据BAM文件的大部分(人类基因组的约75%)。将SAM / BAM压缩成CRAM格式,显着减少了存储NGS结果所需的空间。然而,用于产生CRAM格式的工具不设计用于处理流量信号。此缺失功能有动力开发新程序,以改善离子归档的离子洪流文件的压缩。在本文中,我们呈现IOONCRAM,第一个基于参考的压缩工具,用于压缩离子粗糙BAM文件,用于长期归档。对于BAM文件,IONCRAM可以实现节省约43%的空间。这种空间节省优于CRAM格式达到约8-9%的原因。降低NGS数据的空间消耗降低了存储和数据传输的成本。因此,开发用于临床NGS数据的有效压缩软件超出了计算兴趣;因为它最终有助于临床测试的总体成本降低。我们工具实现的节省空间是朝这个方向的实际步骤。该工具是开源的,在代码海洋,Github和http://ioncram.saudigenomeProject.com上提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号