【24h】

A Distributed System for Genetic Linkage Analysis

机译:分布式遗传连锁分析系统

获取原文
获取原文并翻译 | 示例

摘要

Linkage analysis is a tool used by geneticists for mapping disease-susceptibility genes in the study of Mendelian and complex diseases. However analyses of large inbred pedigrees with extensive missing data are often beyond the capabilities of a single computer. We present a distributed system called SUPERLINK-ONLINE for computing multipoint LOD scores of large inbred pedigrees. It achieves high performance via efficient parallelization of the algorithms in SUPERLINK, a state-of-the-art serial program for these tasks, and through utilization of thousands of resources residing in multiple opportunistic grid environments. Notably, the system is available online, which allows computationally intensive analyses to be performed with no need for either installation of software, or maintenance of a complicated distributed environment. The main algorithmic challenges have been to efficiently split large tasks for distributed execution in a highly dynamic non-dedicated running environment, as well as to utilize resources in all the available grid environments. Meeting these challenges has provided nearly interactive response time for shorter tasks while simultaneously serving massively parallel ones. The system, which is being used extensively by medical centers worldwide, achieves speedups of up to three orders of magnitude and allows analyses that were previously infeasible.
机译:连锁分析是遗传学家在孟德尔和复杂疾病研究中用于绘制疾病易感基因图的工具。但是,对于大型近亲谱系的分析却缺少大量数据,这通常超出了单台计算机的能力范围。我们提出了一个分布式系统SUPERLINK-ONLINE,用于计算大型近交系谱的多点LOD分数。它通过高效并行化SUPERLINK中的算法(用于这些任务的最先进的串行程序)以及利用位于多个机会网格环境中的数千种资源来实现高性能。值得注意的是,该系统可在线使用,从而无需安装软件或维护复杂的分布式环境即可执行计算密集型分析。主要的算法挑战是如何在高度动态的非专用运行环境中有效地拆分大型任务以进行分布式执行,以及在所有可用网格环境中利用资源。应对这些挑战为较短的任务提供了几乎交互式的响应时间,同时为大规模并行任务提供服务。该系统已被世界各地的医疗中心广泛使用,可将速度提高多达三个数量级,并可以进行以前不可行的分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号