首页> 美国卫生研究院文献>Bioinformation >Compression of Large genomic datasets using COMRAD on Parallel Computing Platform

【2h】

Compression of Large genomic datasets using COMRAD on Parallel Computing Platform

机译：在并行计算平台上使用COMRAD压缩大型基因组数据集

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

The big data storage is a challenge in a post genome era. Hence, there is a need for high performance computing solutions for managing large genomic data. Therefore, it is of interest to describe a parallel-computing approach using message-passing library for distributing the different compression stages in clusters. The genomic compression helps to reduce the on disk“foot print” of large data volumes of sequences. This supports the computational infrastructure for a more efficient archiving. The approach was shown to find utility in 21 Eukaryotic genomes using stratified sampling in this report. The method achieves an average of 6-fold disk space reduction with three times better compression time than COMRAD.AvailabilityThe source codes are written in C using message passing libraries and are available at

机译：在后基因组时代，大数据存储是一个挑战。因此，需要用于管理大型基因组数据的高性能计算解决方案。因此，有必要描述一种使用消息传递库的并行计算方法，以在群集中分布不同的压缩阶段。基因组压缩有助于减少大序列数据量在磁盘上的“占用空间”。这支持了计算基础架构，以实现更有效的归档。在本报告中，使用分层采样显示该方法可在21个真核生物基因组中找到效用。该方法平均可减少磁盘空间6倍，压缩时间是COMRAD的三倍。可用性源代码使用C语言编写，使用消息传递库，并且可以在以下位置获得

著录项

期刊名称 Bioinformation
作者
Christopher Leela Biji; Manu K Madhu; Vineetha Vishnu; Satheesh Kumar K; Vijayakumar; Achuthsankar S Nair;
展开▼
作者单位

展开▼
年(卷),期 2015(11),5
年度 2015
页码 267–271
总页数 5
原文格式 PDF
正文语种
中图分类生物学;
关键词
Genome compression Sequence analysis Parallel Computing Big data storage Genome Analysis;

机译：基因组压缩;序列分析;并行计算;大数据存储;基因组分析;

相似文献

外文文献
中文文献
专利

1. Integrating multi-platform genomic datasets for kidney renal clear cell carcinoma subtyping using stacked denoising autoencoders [J] . Tongjun Gu, Xiwu Zhao Scientific reports. . 2019,第1期

机译：使用堆叠去噪自动化器整合用于肾肾透明细胞癌亚型的多平台基因组数据集
2. Author Correction: Integrating multi-platform genomic datasets for kidney renal clear cell carcinoma subtyping using stacked denoising autoencoders [J] . Tongjun Gu, Xiwu Zhao Scientific reports. . 2019,第1期

机译：作者校正：使用堆叠的去噪自身叠层集成肾肾透明细胞癌亚型的多平台基因组数据集
3. lncRNA-screen: an interactive platform for computationally screening long non-coding RNAs in large genomics datasets [J] . Yixiao Gong, Hsuan-Ting Huang, Yu Liang, BMC Genomics . 2017,第1期

机译：lncRNA-screen：用于在大型基因组数据集中计算筛选长的非编码RNA的交互式平台
4. Improving Bioinformatics Analysis of Large Sequence Datasets Parallelizing Tools for Population Genomics [C] . Javier Navarro, Gonzalo Vera, Sebastian Ramos-Onsins, International Conference on Parallel and Distributed Computing . 2017

机译：改善大序列数据集的生物信息学分析，对群体基因组学的平行化工具
5. Advancement of computing on large datasets via parallel computing and cyberinfrastructure [D] . Yildirim, Ahmet Artu 2015

机译：通过并行计算和网络基础设施对大型数据集进行计算的进展
6. Parallel comparison of Illumina RNA-Seq and Affymetrix microarray platforms on transcriptomic profiles generated from 5-aza-deoxy-cytidine treated HT-29 colon cancer cells and simulated datasets [O] . Xiao Xu, Yuanhao Zhang, Jennie Williams, 2013

机译：Illumina RNA-Seq和Affymetrix微阵列平台在5-氮杂-脱氧胞苷处理的HT-29结肠癌细胞和模拟数据集生成的转录组谱上的平行比较
7. GPU Accelerated Fractal Image Compression for Medical Imaging in Parallel Computing Platform [O] . Haque, Md. Enamul, Kaisan, Abdullah Al, Saniat, Mahmudur R, 2014

机译：用于医学成像的GpU加速分形图像压缩并行计算平台

Compression of Large genomic datasets using COMRAD on Parallel Computing Platform

摘要

著录项

相似文献

相关主题

期刊订阅