首页> 外文会议>World Multi-Conference on Systemics, Cybernetics and Informatics >A Practical Data-driven Framework for Parallel Data Mining
【24h】

A Practical Data-driven Framework for Parallel Data Mining

机译:一个实用的数据驱动框架,用于并行数据挖掘

获取原文

摘要

In many practical applications, data mining results must be quickly delivered. To achieve the required efficiency, without sacrificing the quality of the results, practitioners are now looking at ways to parallelize the most computationally expensive steps of the data mining process. Realizing mat a complete rewriting of existing sequential programs into parallel ones is often too tedious and expensive, we propose a framework which re-uses existing sequential programs to perform parallel data mining on a computer cluster. The proposed framework relies on the JavaParty system and can be used to parallelize both Java and non-Java programs. This paper details the framework, illustrates the implementation, and presents early experimental results showing the benefits of the approach.
机译:在许多实际应用中,必须快速交付数据挖掘结果。为了实现所需的效率,在不牺牲结果的质量,从业者现在正在寻找与数据挖掘过程中最具计算昂贵的步骤并行化的方法。实现垫将现有的顺序程序完全重写为并行的程序通常太繁琐且昂贵,我们提出了一个框架,该框架重新使用现有的顺序程序来执行计算机集群上的并行数据挖掘。所提出的框架依赖于javaParty系统,可用于并行化Java和非Java程序。本文详细说明了该框架,说明了实施,并提出了早期的实验结果,显示了这种方法的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号