首页> 外文会议> >Study on algorithms of parallel and distributed data mining calculating process

【24h】

Study on algorithms of parallel and distributed data mining calculating process

机译：并行和分布式数据挖掘计算过程算法研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Based on distributed data mining, a kind of parallel and distributed calculating architecture that store partition data information into sub-nodes is introduced by using a thought of partition database and improved Apriori algorithms. It lays emphasis on the data skew in the distributed environment. A converse clustering method is proposed to solve the data skew problem. The corresponding algorithms of parallel and distributed data mining are designed based on the large-scale transaction database. Calculating processes of these algorithms are described in detail. As the parallel and distributed data are processed after effective partition, the transmitted data size is greatly reduced through efficient communication among nodes. The proposed algorithms provide a flexible and extended calculation platform, reduce overhead traffic, and keep a favorable expansibility. The proposed algorithms aim at performing network calculation and finding advantages of network calculation by using a fairly cheap computer. The proposed algorithms can be applied to large parallel or distributed single computer environment.

机译：在分布式数据挖掘的基础上，结合分区数据库思想和改进的Apriori算法，提出了一种将分区数据信息存储到子节点中的并行分布式计算架构。它着重于分布式环境中的数据偏斜。提出了一种逆向聚类方法来解决数据偏斜问题。基于大规模交易数据库，设计了相应的并行和分布式数据挖掘算法。详细描述了这些算法的计算过程。由于并行数据和分布式数据是在有效分区之后进行处理的，因此通过节点之间的有效通信可以大大减少传输数据的大小。所提出的算法提供了灵活且扩展的计算平台，减少了开销流量，并保持了良好的可扩展性。所提出的算法旨在通过使用相当便宜的计算机来执行网络计算并发现网络计算的优势。所提出的算法可以应用于大型并行或分布式单计算机环境。

著录项

来源
《》|2005年|P.2084-2089|共6页
会议地点
作者
Ying-Wu Fang; Xiu-Bing Zhao; Guang-Peng Zhang; Yi Wang; Yi Sun; Yong-Fang Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术 ;
关键词
data mining; parallel algorithms; transaction processing; very large databases; converse clustering; data mining; data skew; distributed algorithm; improved Apriori algorithm; large-scale transaction database; network calculation; parallel algorithm; partition datab;

机译：数据挖掘并行算法事务处理大型数据库逆向聚类数据挖掘数据偏斜分布式算法改进的Apriori算法大型事务数据库网络计算并行算法分区数据b;

相似文献

外文文献
中文文献
专利

1. An Optimized Distributed Association Rule Mining Algorithm in Parallel and Distributed Data Mining with XML Data for Improved Response Time [J] . Sujni Paul International Journal of Computer Science & Information Technology (IJCSIT) . 2010 ,第2期

机译：XML数据并行和分布式数据挖掘中的优化分布式关联规则挖掘算法，可提高响应时间
2. A special issue of Journal of Parallel and Distributed Computing: Models and algorithms for high-performance distributed data mining [J] . Alfredo Cuzzocrea Journal of Parallel and Distributed Computing . 2011 ,第5期

机译：《并行与分布式计算杂志》特刊：高性能分布式数据挖掘的模型和算法
3. A Bioinformatics-Inspired Adaptation to Ukkonen’s Edit Distance Calculating Algorithm and Its Applicability Towards Distributed Data Mining [J] . Johnson Bruce Journal of Software Engineering and Applications . 2008 ,第1期

机译：Ukkonen的编辑距离计算算法的生物信息学启发式改编及其在分布式数据挖掘中的适用性
4. Study on algorithms of parallel and distributed data mining calculating process [C] . Ying-Wu Fang, Xiu-Bing Zhao, Guang-Peng Zhang, International Conference on Machine Learning and Cybernetics . 2005

机译：并行分布式数据挖掘计算过程的算法研究
5. Parallel processing of best-first branch and bound algorithms on distributed memory multiprocessors. [D] . Abdel-Rahman, Tarek Saad. 1989

机译：分布式内存多处理器上最佳优先分支和绑定算法的并行处理。
6. affyPara—a Bioconductor Package for Parallelized Preprocessing Algorithms of Affymetrix Microarray Data [O] . Markus Schmidberger, Esmeralda Vicedo, Ulrich Mansmann 2009

机译：affyPara-用于Affymetrix芯片数据的并行预处理算法的生物导体包装
7. AN OPTIMIZED DISTRIBUTED ASSOCIATION RULE MINING ALGORITHM IN PARALLEL AND DISTRIBUTED DATA MINING WITH XML DATA FOR IMPROVED RESPONSE TIME. [O] . 2011

机译：利用XmL数据优化分布式关联规则挖掘和分布式数据挖掘算法，提高响应时间。

Study on algorithms of parallel and distributed data mining calculating process

摘要

著录项

相似文献

相关主题

期刊订阅