Genome-Scale Computational Approaches to Memory-Intensive Applications in Systems Biology

机译：基因组规模的计算方法，用于系统生物学中的内存密集型应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Graph-theoretical approaches to biological network analysis have proven to be effective for small networks but are computationally infeasible for comprehensive genome-scale systems-level elucidation of these networks. The difficulty lies in the NP-hard nature of many global systems biology problems that, in practice, translates to exponential (or worse) run times for finding exact optimal solutions. Moreover, these problems, especially those of an enumerative flavor, are often memory-intensive and must share very large sets of data effectively across many processors. For example, the enumeration of maximal cliques - a core component in gene expression networks analysis, cis regulatory motif finding, and the study of quantitative trait loci for high-throughput molecular phenotypes can result in as many as 3^n/3 maximal cliques for a graph with n vertices. Memory requirements to store those cliques reach terabyte scales even on modest-sized genomes. Emerging hardware architectures with ultra-large globally addressable memory such as the SGI Altix and Cray X1 seem to be well suited for addressing these types of data-intensive problems in systems biology. This paper presents a novel framework that provides exact, parallel and scalable solutions to various graph-theoretical approaches to genome-scale elucidation of biological networks. This framework takes advantage of these large-memory architectures by creating globally addressable bitmap memory indices with potentially high compression rates, fast bitwise-logical operations, and reduced search space. Augmented with recent theoretical advancements based on fixed-parameter tractability, this framework produces computationally feasible performance for genome-scale combinatorial problems of systems biology.

机译：基于图论的生物网络分析方法已被证明对小型网络有效，但对于这些网络的全面基因组规模的系统级解释在计算上是不可行的。困难在于许多全球系统生物学问题的NP难性，在实践中，这些问题转化为指数（或更差的）运行时间以寻找精确的最佳解决方案。而且，这些问题，尤其是枚举问题，通常占用大量内存，并且必须在许多处理器之间有效地共享非常大的数据集。例如，最大集团的枚举-基因表达网络分析，顺式调控基序发现以及高通量分子表型的数量性状基因座研究的核心组成部分，可导致多达3 ^ n / 3的最大集团具有n个顶点的图。即使在中等大小的基因组上，存储这些团的内存需求也达到了TB级。具有超大全局可寻址内存的新兴硬件体系结构，例如SGI Altix和Cray X1，似乎非常适合解决系统生物学中的这类数据密集型问题。本文提出了一个新颖的框架，该框架为各种图论方法提供了精确，并行和可扩展的解决方案，以阐明生物网络的基因组规模。该框架通过创建全局可寻址的位图内存索引来利用这些大内存体系结构，这些索引可能具有较高的压缩率，快速的按位逻辑运算和减少的搜索空间。借助基于固定参数可扩展性的最新理论进展，此框架为系统生物学的基因组规模组合问题提供了计算上可行的性能。

著录项

来源
《Supercomputing, 2005. Proceedings of the ACM/IEEE SC 2005 Conference》|2005年|P.12|共1页
会议地点
作者
Yun Zhang; Abu-Khzam F.N.; Baldwin N.E.; Chesler E.J.; Langston M.A.; Samatova N.F.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术;
关键词

相似文献

外文文献
中文文献
专利

1. Computational approaches to metabolic engineering utilizing systems biology and synthetic biology [J] . Stephen S. Fong Computational and Structural Biotechnology Journal . 2014,第18期

机译：利用系统生物学和合成生物学的代谢工程计算方法
2. Combined Computational Systems Biology and Computational Neuroscience Approaches Help Develop of Future “Cognitive Developmental Robotics” [J] . Faramarz Faghihi, Ahmed A. Moustafa Frontiers in Neurorobotics . 2017,第4期

机译：计算系统生物学和计算神经科学相结合的方法有助于发展未来的“认知发展机器人”
3. Algorithms in Computational Molecular Biology: Techniques, Approaches and Applications [J] . Energy Business Journal . 2011,第jana3aocta31期

机译：计算分子生物学中的算法：技术，方法和应用
4. Genome-Scale Computational Approaches to Memory-Intensive Applications in Systems Biology [C] . Yun Zhang, Faisal N. Abu-Khzam, Nicole E. Baldwin, ACM/IEEE conference on Supercomputing . 2005

机译：基因组规模的计算方法，用于系统生物学中的内存密集型应用
5. Computational approaches to systems biology: Applications in xenobiotic metabolism and cellular signaling. [D] . Finley, Stacey Deleria. 2009

机译：系统生物学的计算方法：在异生物代谢和细胞信号传导中的应用。
6. Research Topic: From structural to molecular systems biology: experimental and computational approaches to unravel mechanisms of kinase activity regulation in cancer and neurodegeneration: Hepatocellular carcinoma: a systems biology perspective [O] . Lorenza A. DAlessandro, René Meyer, Ursula Klingmüller 2013

机译：研究主题：从结构生物学到分子系统生物学：揭示癌症和神经变性中激酶活性调节机制的实验和计算方法：肝细胞癌：系统生物学的观点
7. Genome-scale computational approaches to memory-intensive applications in systems biology [O] . Abu-Khzam F.N., Zhang Yun, Baldwin N.E., 2017

机译：系统生物学中内存密集型应用程序的基因组规模计算方法

Genome-Scale Computational Approaches to Memory-Intensive Applications in Systems Biology

摘要

著录项

相似文献

相关主题

期刊订阅