by exploiting the fine-grained parallelism and superior hardware performance on the parallel computing platform for speeding up compute-intensive calculations,by using in-memory data structures on the parallel computing platform to cache data sets between a sequence of time-lagged queries on the same data, so that these queries can be processed without further data transfer overheads,by replicating data within the parallel computing platform so that multiple independent queries on the same target data set can be simultaneously processed using independent parallel partitions of the high-performance computing platform.;A specific embodiment of this invention was used for deploying a bio-informatics application involving gene and protein sequence matching using the Smith-Waterman algorithm on a database system connected via an Ethernet local area network to a parallel supercomputer."/> SYSTEM AND METHOD FOR EXECUTING COMPUTE-INTENSIVE DATABASE USER-DEFINED PROGRAMS ON AN ATTACHED HIGH-PERFORMANCE PARALLEL COMPUTER
首页> 外国专利> SYSTEM AND METHOD FOR EXECUTING COMPUTE-INTENSIVE DATABASE USER-DEFINED PROGRAMS ON AN ATTACHED HIGH-PERFORMANCE PARALLEL COMPUTER

SYSTEM AND METHOD FOR EXECUTING COMPUTE-INTENSIVE DATABASE USER-DEFINED PROGRAMS ON AN ATTACHED HIGH-PERFORMANCE PARALLEL COMPUTER

机译:在连接的高性能并行计算机上执行计算密集型数据库用户定义程序的系统和方法

摘要

The invention pertains to a system and method for dispatching and executing the compute-intensive parts of the workflow for database queries on an attached high-performance, parallel computing platform. The performance overhead for moving the required data and results between the database platform and the high-performance computing platform where the workload is executed is amortized in several ways, for example,by exploiting the fine-grained parallelism and superior hardware performance on the parallel computing platform for speeding up compute-intensive calculations,by using in-memory data structures on the parallel computing platform to cache data sets between a sequence of time-lagged queries on the same data, so that these queries can be processed without further data transfer overheads,by replicating data within the parallel computing platform so that multiple independent queries on the same target data set can be simultaneously processed using independent parallel partitions of the high-performance computing platform.;A specific embodiment of this invention was used for deploying a bio-informatics application involving gene and protein sequence matching using the Smith-Waterman algorithm on a database system connected via an Ethernet local area network to a parallel supercomputer.
机译:本发明涉及一种系统和方法,该系统和方法用于在附加的高性能并行计算平台上分配和执行工作流的计算密集型部分以进行数据库查询。在数据库平台和执行工作负载的高性能计算平台之间移动所需数据和结果的性能开销可以通过几种方式摊销,例如, 通过利用并行计算平台上的细粒度并行性和出色的硬件性能来加快计算密集型计算, by使用并行计算平台上的内存中数据结构在同一数据的一系列时间滞后查询之间缓存数据集,以便可以处理这些查询而无需进一步的数据传输开销, ,通过在并行计算平台内复制数据,以便可以使用高性能计算平台的独立并行分区同时处理同一目标数据集上的多个独立查询。 < / UnorderedList> ;使用了本发明的特定实施例d用于在通过以太网局域网连接到并行超级计算机的数据库系统上部署使用Smith-Waterman算法进行涉及基因和蛋白质序列匹配的生物信息学应用程序。

著录项

  • 公开/公告号US2009077011A1

    专利类型

  • 公开/公告日2009-03-19

    原文格式PDF

  • 申请/专利权人 RAMESH NATARAJAN;MICHAEL KOCHTE;

    申请/专利号US20070856130

  • 发明设计人 RAMESH NATARAJAN;MICHAEL KOCHTE;

    申请日2007-09-17

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 19:34:37

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号