首页> 外文学位 >Research on distributed data mining system and algorithm based on multi-agent.
【24h】

Research on distributed data mining system and algorithm based on multi-agent.

机译:基于多智能体的分布式数据挖掘系统与算法研究。

获取原文
获取原文并翻译 | 示例

摘要

Data mining means extracting hidden, previous unknown knowledge and rules with potential value to decision from mass data in database. Association rule mining is a main researching area of data mining area, which is widely used in practice. With the development of network technology and the improvement of level of IT application, distributed database is commonly used. Distributed data mining is mining overall knowledge which is useful for management and decision from database distributed in geography. It has become an important issue in data mining analysis. Distributed data mining can achieve a mining task with computers in different site on the Internet. It can not only improve the mining efficiency, reduce the transmitting amount of network data, but is also good for security and privacy of data. Based on related theories and current research situation of data mining and distributed data mining, this thesis will focus on analysis on the structure of distributed mining system and distributed association rule mining algorithm.;Key words: data mining, distributed, Association rule, multi-agent , RK-tree algorithm;This thesis first raises a structure of distributed data mining system which is base on multi-agent. It adopts star network topology, and realize distributed saving mass data mining with multi-agent. Based on raised distributed data mining system, this these brings about a new distributed association rule mining algorithm---RK-tree algorithm. RK-tree algorithm is based on the basic theory of twice knowledge combination. Each sub-site point first mines local frequency itemset from local database, then send the mined local frequency itemset to the main site point. The main site point combines those local frequency itemset and get overall candidate frequency itemset, and send the obtained overall candidate frequency itemset to each sub-site point. Each sub-site point count the supporting rate of those overall candidate frequency itemset and sent it back to the main site point. At last, the main site point combines the results sent by sub-site point and gets the overall frequency itemset and overall associtation rule. This algorithm just needs three times communication between the main and sub-site points, which greatly reduces the amount and times of communication, and improves the efficiency of selection. What's more, each sub-site point can fully use existing good centralized association rule mining algorithm to realize local association rule mining, which can enable them to obtain better local data mining efficiency, as well as reduce the workload. This algorithm is simple and easy to realize. The last part of this thesis is the conclusion of the analysis, as well as the direction of further research.
机译:数据挖掘意味着从数据库中的海量数据中提取具有潜在价值的,隐藏的,先前未知的知识和规则,以供决策。关联规则挖掘是数据挖掘领域的主要研究领域,在实践中得到了广泛的应用。随着网络技术的发展和IT应用水平的提高,分布式数据库已成为人们普遍使用的数据库。分布式数据挖掘是挖掘整体知识,这对于从地理分布的数据库进行管理和决策很有用。它已成为数据挖掘分析中的重要问题。分布式数据挖掘可以使用Internet上不同站点中的计算机来实现挖掘任务。它不仅可以提高挖掘效率,减少网络数据的传输量,而且对数据的安全性和私密性也有好处。本文根据相关理论和数据挖掘与分布式数据挖掘的研究现状,着重分析分布式挖掘系统的结构和分布式关联规则挖掘算法。关键词:数据挖掘分布式关联规则多重挖掘Agent,RK-tree算法;本文首先提出了一种基于多Agent的分布式数据挖掘系统的结构。它采用星型网络拓扑结构,并实现了多主体的分布式节省海量数据挖掘。这些基于改进的分布式数据挖掘系统,带来了一种新的分布式关联规则挖掘算法-RK-tree算法。 RK-tree算法基于两次知识组合的基本理论。每个子站点点首先从本地数据库中挖掘本地频率项集,然后将挖掘的本地频率项集发送到主站点点。主站点点结合那些局部频率项集并获得整体候选频率项集,并将获得的整体候选频率项集发送到每个子站点点。每个子站点点计算这些整体候选频率项集的支持率,并将其发送回主要站点点。最后,主站点结合子站点发送的结果,得到整体频率项集和整体关联规则。该算法只需要在主站点和子站点之间进行三遍通信即可,大大减少了通信量和通信时间,提高了选择效率。而且,每个子站点可以充分利用现有的良好的集中式关联规则挖掘算法来实现局部关联规则挖掘,从而使他们可以获得更好的局部数据挖掘效率,并减少工作量。该算法简单易实现。本文的最后一部分是分析的结论以及进一步研究的方向。

著录项

  • 作者

    Jiang, Lingxia.;

  • 作者单位

    Universite du Quebec a Chicoutimi (Canada).;

  • 授予单位 Universite du Quebec a Chicoutimi (Canada).;
  • 学科 Computer Science.
  • 学位 M.Sc.
  • 年度 2009
  • 页码 97 p.
  • 总页数 97
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:38:18

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号