首页> 外文会议>Australian National Health Informatics Conference >SRA Down Under: Cache and Analysis Platform for Infectious Disease
【24h】

SRA Down Under: Cache and Analysis Platform for Infectious Disease

机译:SRA Down:传染病缓存和分析平台

获取原文

摘要

SRA, NCBI's Sequence Read Archive, is a valuable resource holding a near definitive collection of the world's collective sequenced reads for academic purposes. Increasingly, these reads are being used for both basic research and clinical investigations. When time is a critical factor in analysis, such as during bacterial outbreaks, the geographical separation between Australia and the offshore NCBI SRA servers can result in significant delays that may have adverse clinical outcomes. To address this, Queensland Genomics commissioned a pilot program for the establishment of a local Australian SRA Cache. Utilizing the hosting capabilities of the NeCTAR Research Cloud, QRIScloud's HTC infrastructure and the MeDiCI data fabric as a storage solution, and the software stack of Cromwell for workflow management, PostgreSQL database for sample and job metadata, and a coordinator Python Flask application, a local cache of seventeen bacterial species was established. Furthermore, the workflow capabilities of Cromwell were leveraged to provide analysis solutions for cached sample data, including quality control and taxonomic profiling, and individual and multiple sample analysis. Moving forward to a broader rollout of increased bacterial species, it was found that the initial storage estimation did not keep up with the exponential increase sequencing reads uploaded to NCBI SRA, which while highlighting the increasing availability and importance in modem research, will need to be addressed.
机译:SRA,NCBI的序列读取档案,是一个有价值的资源,持有世界上集体测序读取的近乎定义的学术目的。这些读数越来越多地用于基础研究和临床调查。当时间是分析中的关键因素时,例如在细菌爆发期间,澳大利亚和海上NCBI SRA服务器之间的地理分离可能导致可能具有不利临床结果的显着延迟。为了解决这个问题,昆士兰州基因组学委托了一个试点计划,以建立当地的澳大利亚SRA缓存。利用花蜜研究云的托管能力,QRISCLOUD的HTC基础设施和Medici数据结构作为存储解决方案,以及用于样品和作业元数据的PostgreSQL数据库的Cromwell软件堆栈,以及协调器Python烧瓶应用程序,即本地建立了十七种细菌物种的缓存。此外,克罗姆威尔的工作流程能力被利用,为缓存的样本数据提供分析解决方案,包括质量控制和分类分析,以及个人和多个样本分析。前进到更广泛的细菌种类的卷展栏,发现初始存储估计没有跟上上传到NCBI SRA的指数增加测序读取,这在突出越来越多的可用性和调制解调器研究中的重要性,需要是解决。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号