A Practical Data-driven Framework for Parallel Data Mining

机译：一个实用的数据驱动框架，用于并行数据挖掘

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In many practical applications, data mining results must be quickly delivered. To achieve the required efficiency, without sacrificing the quality of the results, practitioners are now looking at ways to parallelize the most computationally expensive steps of the data mining process. Realizing mat a complete rewriting of existing sequential programs into parallel ones is often too tedious and expensive, we propose a framework which re-uses existing sequential programs to perform parallel data mining on a computer cluster. The proposed framework relies on the JavaParty system and can be used to parallelize both Java and non-Java programs. This paper details the framework, illustrates the implementation, and presents early experimental results showing the benefits of the approach.

机译：在许多实际应用中，必须快速交付数据挖掘结果。为了实现所需的效率，在不牺牲结果的质量，从业者现在正在寻找与数据挖掘过程中最具计算昂贵的步骤并行化的方法。实现垫将现有的顺序程序完全重写为并行的程序通常太繁琐且昂贵，我们提出了一个框架，该框架重新使用现有的顺序程序来执行计算机集群上的并行数据挖掘。所提出的框架依赖于javaParty系统，可用于并行化Java和非Java程序。本文详细说明了该框架，说明了实施，并提出了早期的实验结果，显示了这种方法的好处。

著录项

来源
《World Multi-Conference on Systemics, Cybernetics and Informatics》|2005年||共6页
会议地点
作者
Chunsheng Yang; Sylvain Letourneau; International Institute of Informatics and Systemics(IIIS);
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
parallel data mining; feature extraction; modelevaluation or testing; javaparty;

机译：并行数据挖掘;特征提取;模型观测或测试;javaparty;

相似文献

外文文献
中文文献
专利

1. High Performance Computation of Big Data: Performance Optimization Approach towards a Parallel Frequent Item Set Mining Algorithm for Transaction Data based on Hadoop MapReduce Framework [J] . Guru Prasad M S, Nagesh H R, Swathi Prabhu International Journal of Intelligent Systems and Applications . 2017,第1期

机译：大数据的高性能计算：基于Hadoop MapReduce框架的事务数据并行频繁项集挖掘算法的性能优化方法
2. Legitimising data-driven models: Exemplification of a new data-driven mechanistic modelling framework [J] . Mount N.J., Dawson C.W., Abrahart R.J. Hydrology and Earth System Sciences . 2013,第7期

机译：使数据驱动的模型合法化：举例说明新的数据驱动的机械建模框架
3. Legitimising data-driven models: exemplification of a new data-driven mechanistic modelling framework [J] . Mount N. J., Dawson C. W., Abrahart R. J. Hydrology and Earth System Sciences . 2013,第7期

机译：使数据驱动的模型合法化：新的数据驱动的机械建模框架的例证
4. A Practical Data-driven Framework for Parallel Data Mining [C] . Chunsheng Yang, Sylvain Letourneau The 9th World Multi-Conference on Systemics, Cybernetics and Informatics(WMSCI 2005) vol.4 . 2005

机译：实用的数据驱动的并行数据挖掘框架
5. Towards Practical Data-driven Network Design [D] . Li, Zhijing. 2019

机译：实现实用数据驱动的网络设计
6. COMBImage2: a parallel computational framework for higher-order drug combination analysis that includes automated plate design matched filter based object counting and temporal data mining [O] . Efthymia Chantzi, Malin Jarvius, Mia Niklasson, 2019

机译：COMBImage2：用于高阶药物组合分析的并行计算框架包括自动化板设计基于匹配滤波器的对象计数和时间数据挖掘
7. A paralleled big data algorithm with mapreduce framework for mining twitter data [O] . Bing L, Chan KCC 2015

机译：带有mapreduce框架的并行大数据算法，用于挖掘Twitter数据
8. Mining of Multivariate Temporal Biological Data: A Framework for the Rational Design of Data-Driven Models [R] . Kamimura, R. T., Bicciato, S., Shimizu, H., 2001

机译：多变量时态生物数据的挖掘：数据驱动模型合理设计的框架

A Practical Data-driven Framework for Parallel Data Mining

摘要

著录项

相似文献

相关主题

期刊订阅