...
首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Dynamic Load Balancing for the Distributed Mining of Molecular Structures
【24h】

Dynamic Load Balancing for the Distributed Mining of Molecular Structures

机译:分子结构分布式挖掘的动态负载平衡

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In molecular biology, it is often desirable to find common properties in large numbers of drug candidates. One family of methods stems from the data mining community, where algorithms to find frequent graphs have received increasing attention over the past years. However, the computational complexity of the underlying problem and the large amount of data to be explored essentially render sequential algorithms useless. In this paper, we present a distributed approach to the frequent subgraph mining problem to discover interesting patterns in molecular compounds. This problem is characterized by a highly irregular search tree, whereby no reliable workload prediction is available. We describe the three main aspects of the proposed distributed algorithm, namely, a dynamic partitioning of the search space, a distribution process based on a peer-to-peer communication framework, and a novel receiver-initiated load balancing algorithm. The effectiveness of the distributed method has been evaluated on the well-known National Cancer Institute's HIV-screening data set, where we were able to show close-to linear speedup in a network of workstations. The proposed approach also allows for dynamic resource aggregation in a nondedicated computational environment. These features make it suitable for large-scale, multidomain, heterogeneous environments, such as computational grids.
机译:在分子生物学中,通常希望找到大量候选药物的共同特性。数据挖掘社区提供了一种方法,在过去的几年中,用于查找频繁图形的算法受到越来越多的关注。但是,潜在问题的计算复杂性和要探索的大量数据实质上使顺序算法无用。在本文中,我们提出了一种解决频繁子图挖掘问题的分布式方法,以发现分子化合物中有趣的模式。此问题的特征在于高度不规则的搜索树,因此没有可靠的工作负载预测可用。我们描述了所提出的分布式算法的三个主要方面,即搜索空间的动态分区,基于对等通信框架的分发过程以及新颖的接收器启动的负载平衡算法。分布式方法的有效性已在著名的美国国家癌症研究所的HIV筛查数据集上进行了评估,在该数据集中,我们能够在工作站网络中显示接近线性的加速。所提出的方法还允许在非专用计算环境中进行动态资源聚合。这些功能使其适用于大规模,多域,异构环境,例如计算网格。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号