High-throughput computation of pairwise sequence similarities for multiple genome comparisons using ScalaBLAST

机译：使用Scalablast的多个基因组比较的成对序列相似性的高吞吐量计算

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Genome sequence comparisons of exponentially growing data sets form the foundation for the comparative analysis tools provided by community biological data resources such as the Integrated Microbial Genome (IMG) system at the Joint Genome Institute (JGI). For a genome sequencing center to provide multiple-genome comparison capabilities, it must keep pace with exponentially growing collection of sequence data, both from its own genomes, and from public genomes. We present an example of how ScalaBLAST, a high-throughput sequence analysis program, harnesses increasingly critical high-performance computing to perform sequence analysis, enabling, for example, all vs. all BLAST runs across 2 million protein sequences within a day using thousands of processors as opposed to conventional comparison methods that would take years to complete.

机译：基因组序列比较指数越来越多的数据集形成了由社区生物数据资源如联合基因组研究所（JGI）的集成微生物基因组（IMG）系统提供的对比分析工具的基础。对于基因组测序中心提供多基因组比较能力，必须与其自身基因组和公共基因组的序列数据的序列数据集合一致。我们展示了缩放，高吞吐量序列分析程序，利用越来越关键的高性能计算的示例，以执行序列分析，例如，所有与所有爆炸在使用成千上万的一天内跨越200万蛋白序列的爆炸运行。处理器与传统比较方法相反，需要多年来完成。

著录项

来源
《IEEE/NIH Life Science Systems and Applications Workshop》|2007年||共3页
会议地点
作者
Anuj R. Shah; Victor M. Markowitz; Christopher S. Oehmen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods [J] . Park J., Barrett C., Hughey R., Journal of Molecular Biology . 1998,第4期

机译：使用多个序列进行的序列比较检测到的远程同源物是成对方法的三倍
2. PipTools: A Computational Toolkit to Annotate and Analyze Pairwise Comparisons of Genomic Sequences. [J] . Elnitski L, Riemer C, Petrykowska H, Genomics . 2002,第6期

机译：PipTools：一种用于注释和分析基因组序列的成对比较的计算工具包。
3. Breaking the computational barriers of pairwise genome comparison [J] . Oscar Torreno, Oswaldo Trelles BMC Bioinformatics . 2015,第1期

机译：打破成对基因组比较的计算障碍
4. High-throughput computation of pairwise sequence similarities for multiple genome comparisons using ScalaBLAST [C] . Anuj R. Shah, Victor M. Markowitz, Christopher S. Oehmen IEEE/NIH Life Science Systems and Applications Workshop . 2007

机译：使用Scalablast的多个基因组比较的成对序列相似性的高吞吐量计算
5. Computational tools for the analysis of high-throughput genome-scale sequence data [D] . Lopez, David Adrian, Jr. 2016

机译：用于分析高通量基因组规模序列数据的计算工具
6. KISSa: a strategy to build multiple sequence alignments from pairwise comparisons of very closely related sequences [O] . Francesco Marass, Chris Upton 2009

机译：KISSa：一种从非常紧密相关的序列的成对比较中建立多个序列比对的策略
7. KISSa: a strategy to build multiple sequence alignments from pairwise comparisons of very closely related sequences [O] . 2009

机译：KISSa：一种从非常紧密相关的序列的成对比较中建立多个序列比对的策略

High-throughput computation of pairwise sequence similarities for multiple genome comparisons using ScalaBLAST

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅