An energy-aware bioinformatics application for assembling short reads in high performance computing systems

机译：一种能源敏感型生物信息学应用程序，用于在高性能计算系统中组装短读

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Current biomedical technologies are producing massive amounts of data on an unprecedented scale. The increasing complexity and growth rate of biological data has made bioinformatics data processing and analysis a key and computationally intensive task. High performance computing (HPC) has been successfully applied to major bioinformatics applications to reduce computational burden. However, a naïve approach for developing parallel bioinformatics applications may achieve a high degree of parallelism while unnecessarily expending computational resources and consuming high levels of energy. As the wealth of biological data and associated computational burden continues to increase, there has become a need for the development of energy efficient computational approaches in the bioinformatics domain. To address this issue, we have developed an energy-aware scheduling (EAS) model to run computationally intensive applications that takes both deadline requirements and energy factors into consideration. An example of a computationally demanding process that would benefit from our scheduling model is the assembly of short sequencing reads produced by next generation sequencing technologies. Next generation sequencing produces a very large number of short DNA reads from a biological sample. Multiple overlapping fragments must be aligned and merged into long stretches of contiguous sequence before any useful information can be gathered. The assembly problem is extremely difficult due to the complex nature of underlying genome structure and inherent biological error present in current sequencing technologies. We apply our EAS model to a newly proposed assembly algorithm called Merge and Traverse, giving us the ability to generate speedup profiles. Our EAS model was also able to dynamically adjust the number of nodes needed to meet given deadlines for different sets of reads.

机译：当前的生物医学技术正在以前所未有的规模产生大量数据。生物数据的复杂性和增长率不断提高，已使生物信息学数据处理和分析成为一项关键且计算量大的任务。高性能计算（HPC）已成功应用于主要的生物信息学应用程序，以减少计算负担。但是，开发并行生物信息学应用程序的幼稚方法可以实现高度并行性，同时不必要地消耗计算资源并消耗大量能量。随着生物数据的财富和相关的计算负担继续增加，已经需要在生物信息学领域中开发节能的计算方法。为了解决此问题，我们开发了一种能源感知调度（EAS）模型来运行计算密集型应用程序，该模型同时考虑了期限要求和能源因素。将从我们的调度模型中受益的对计算要求很高的过程的一个示例是由下一代测序技术产生的短测序读段的组装。下一代测序可从生物样品中产生大量的短DNA读数。必须先将多个重叠的片段对齐并合并成较长的连续序列，然后才能收集任何有用的信息。由于基础基因组结构的复杂性和当前测序技术中存在的固有生物学错误，组装问题非常困难。我们将EAS模型应用于新提出的称为Merge and Traverse的组装算法，从而使我们能够生成加速曲线。我们的EAS模型还能够动态调整为满足不同读取集的给定期限而需要的节点数量。

著录项

来源
《2012 International Conference on High Performance Computing amp; Simulation》|2012年|p.154- 160|共7页
会议地点 Madrid(ES)
作者
Warnke Julia; Pawaskar Sachin; Ali Hesham;
展开▼
作者单位

College of Information Science and Technology, University of Nebraska at Omaha, Omaha, Nebraska 68182;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类一般性问题;自动模拟理论（自动仿真理论）;
关键词

相似文献

外文文献
中文文献
专利

1. Energy-aware clustering scheduling of parallel applications on heterogeneous computing systems [J] . Kaur Nirmal, Bhinder Raman Multiagent and grid systems . 2019,第1期

机译：异构计算系统上并行应用程序的能源感知集群调度
2. The Genome Sequencer FLX(TM) System-Longer reads, more applications, straight forward bioinformatics and more complete data sets [J] . Droege M, Hill B Journal of Biotechnology . 2008,第1a2期

机译：Genome Sequencer FLX（TM）系统读者阅读，更多应用，直接的生物信息学和更完整的数据集
3. Short read Illumina data for the de novo assembly of a non-model snail species transcriptome (Radix balthica, Basommatophora, Pulmonata), and a comparison of assembler performance [J] . Barbara Feldmeyer, Christopher W Wheat, Nicolas Krezdorn, BMC Genomics . 2011,第1期

机译：简短阅读非模型蜗牛物种转录组（基数Balthica，Basommatophora，Pulmonata）从头组装的Illumina数据，并比较组装性能
4. An energy-aware bioinformatics application for assembling short reads in high performance computing systems [C] . Warnke Julia, Pawaskar Sachin, Ali Hesham International Conference on High Performance Computing and Simulation . 2012

机译：高性能计算系统中的用于组装短读取的能量感知生物信息学应用
5. Fault-tolerant techniques for high performance computing and a bioinformatics application. [D] . Walters, John Paul N. 2007

机译：高性能计算和生物信息学应用程序的容错技术。
6. Short read Illumina data for the de novo assembly of a non-model snail species transcriptome (Radix balthica Basommatophora Pulmonata) and a comparison of assembler performance [O] . Barbara Feldmeyer, Christopher W Wheat, Nicolas Krezdorn, 2011

机译：简短阅读非模型蜗牛物种转录组（基数bal藜BasommatophoraPulmonata）从头组装的Illumina数据并比较组装机性能
7. Short read Illumina data for the de novo assembly of a non-model snail species transcriptome (Radix balthica, Basommatophora, Pulmonata), and a comparison of assembler performance [O] . Feldmeyer Barbara, Wheat Christopher W., Krezdorn Nicolas, 2011

机译：简短阅读非模型蜗牛物种转录组（基数bal藜，Basommatophora，Pulmonata）从头组装的Illumina数据，并比较组装机性能

An energy-aware bioinformatics application for assembling short reads in high performance computing systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅