首页> 外文会议> >Symposium Evening Tutorial: High-performance Computing Methods for Computational Genomics
【24h】

Symposium Evening Tutorial: High-performance Computing Methods for Computational Genomics

机译:专题研讨会晚会:计算基因组学的高性能计算方法

获取原文

摘要

As biomolecular sequence data continue to be amassed at unprecedented rates, the design of effective computational methods and capabilities that can derive biologically significant information from them has become both increasingly challenging and imperative. In this tutorial, the audience will be first introduced to the different types of biomolecular sequence data and the wealth of information they encode. Following this technical grounding, high-performance computing approaches developed to address some of the most computationally challenging problems in genomics will be described. The contents will be presented in three parts: (i) In the first part, we will describe methods that were designed to query a sequence against a large sequence database. Two popular parallel approaches, mpiBLAST and ScalaBLAST, implementing the NCBI BLAST suite of programs will be described. (ii) Next, we will describe PaCE, which is a parallel DNA sequence clustering algorithm. As direct applications, we will discuss the clustering of large-scale Expressed Sequence Tag data and the assembly of complex genomes. (iii) Finally, we describe GRAPPA, which is a high-performance software suite developed for phylogenetic reconstruction of a collection of genomes or genes. Throughout the tutorial, emphasis will be on both scalability and effectiveness in exploiting large-scale state-of-the-art supercomputing technologies. The intended audience are academic and industry researchers, educators, and/or commercial application developers, with a computational background. No background in biology is assumed.
机译:随着生物分子序列数据以前所未有的速度不断积累,设计有效的计算方法和功能以从中获取生物学上重要的信息变得越来越具有挑战性,也势在必行。在本教程中,将首先向观众介绍生物分子序列数据的不同类型及其所编码的大量信息。在此技术基础上,将描述为解决基因组学中一些最具计算挑战性问题而开发的高性能计算方法。内容将分为三个部分:(i)在第一部分中,我们将描述旨在针对大型序列数据库查询序列的方法。将描述实现NCBI BLAST程序套件的两种流行的并行方法,mpiBLAST和ScalaBLAST。 (ii)接下来,我们将描述PaCE,这是一种并行的DNA序列聚类算法。作为直接应用,我们将讨论大规模表达序列标签数据的聚类和复杂基因组的组装。 (iii)最后,我们介绍GRAPPA,这是一个高性能的软件套件,用于系统地重建一组基因组或基因。在整个教程中,重点将放在开发大规模最新的超级计算技术上的可伸缩性和有效性上。目标受众是具有计算背景的学术和行业研究人员,教育工作者和/或商业应用程序开发人员。没有生物学背景。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号