A distributed framewark for parallel data mining using HPJava

O Rana; D Fisk

首页> 外文期刊>BT Technology Journal >A distributed framewark for parallel data mining using HPJava

【24h】

A distributed framewark for parallel data mining using HPJava

机译：使用HPJava进行并行数据挖掘的分布式框架

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Java has become a language of choice for applications executing in heterogeneous environments utilising distributed objects and multithreading. To handle large data sets, scalable and efficient implementations of data mining approaches are required, generally employing computationally intensive algorithms. Conventional Java implementations do not directly provide support for the data structures often encountered in such algorithms, and they also lack repeatability in numerical precision across platforms. This paper describes a distributed framework employing task and data parallelism, and implemented in high performance Java (HPJava). Issues of interest for data mining algorithms are identified, and possible solutions discussed for overcoming limitations in the Java Virtual Machine. The framework supports parallelism across workstation clusters, using the message-passing interface as middleware, and can support different analysis algorithms, wrapped as Java objects, and linked to various databases using the Java database connectivity interface. Guidelines are provided for implementing parallel and distributed data mining on large data sets, and a proof-of-concept data mining application is analysed using a neural network.

机译：Java已成为在使用分布式对象和多线程的异构环境中执行的应用程序的选择语言。为了处理大型数据集，通常需要采用计算密集型算法，因此需要可伸缩且高效的数据挖掘方法实现。常规的Java实现不能直接为此类算法中经常遇到的数据结构提供支持，并且它们在跨平台的数值精度方面也缺乏可重复性。本文介绍了一种采用任务和数据并行性的分布式框架，并以高性能Java（HPJava）实现。确定了数据挖掘算法感兴趣的问题，并讨论了克服Java虚拟机限制的可能解决方案。该框架使用消息传递接口作为中间件，支持跨工作站集群的并行性，并且可以支持不同的分析算法，包装为Java对象，并使用Java数据库连接性接口链接到各种数据库。提供了在大型数据集上实现并行和分布式数据挖掘的指南，并使用神经网络分析了概念验证数据挖掘应用程序。

著录项

来源
《BT Technology Journal》 |1999年第3期|146-154|共9页
作者
O Rana; D Fisk;
展开▼
作者单位

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
入库时间 2022-08-18 00:29:02

相似文献

外文文献
中文文献
专利

1. An Optimized Distributed Association Rule Mining Algorithm in Parallel and Distributed Data Mining with XML Data for Improved Response Time [J] . Sujni Paul International Journal of Computer Science & Information Technology (IJCSIT) . 2010,第2期

机译：XML数据并行和分布式数据挖掘中的优化分布式关联规则挖掘算法，可提高响应时间
2. HPJAVA: A DATA PARALLEL PROGRAMMING ALTERNATIVE [J] . Bryan Carpenter, Geoffrey Fox Computing in science & engineering . 2003,第3期

机译：HPJAVA：数据并行编程替代方案
3. A special issue of Journal of Parallel and Distributed Computing: Models and algorithms for high-performance distributed data mining [J] . Alfredo Cuzzocrea Journal of Parallel and Distributed Computing . 2011,第5期

机译：《并行与分布式计算杂志》特刊：高性能分布式数据挖掘的模型和算法
4. HPJava based Parallel Data Mining Framework [C] . Omer Rana, Donald Fisk ISCA International Conference on Information Reuse and Integration . 1999

机译：基于HPJava的并行数据挖掘框架
5. Contributions to parallel and distributed computing in knowledge discovery and data mining. [D] . Lozano Inca, Elio. 2006

机译：在知识发现和数据挖掘中对并行和分布式计算的贡献。
6. Web Based Parallel/Distributed Medical Data Mining Using Software Agents [O] . Hillol Kargupta, Brian Stafford, Ilker Hamzaoglu 1997

机译：使用软件代理的基于Web的并行/分布式医学数据挖掘
7. AN OPTIMIZED DISTRIBUTED ASSOCIATION RULE MINING ALGORITHM IN PARALLEL AND DISTRIBUTED DATA MINING WITH XML DATA FOR IMPROVED RESPONSE TIME. [O] . 2011

机译：利用XmL数据优化分布式关联规则挖掘和分布式数据挖掘算法，提高响应时间。

A distributed framewark for parallel data mining using HPJava

摘要

著录项

相似文献

相关主题

期刊订阅