首页> 外文学位 >Agent-Based Cooperative Heterogeneous Data Mining.
【24h】

Agent-Based Cooperative Heterogeneous Data Mining.

机译:基于代理的协作异构数据挖掘。

获取原文
获取原文并翻译 | 示例

摘要

This thesis presents an agent-based cooperative data mining model named CoLe2. CoLe 2 is targeted at performing data mining on large, heterogeneous data sets. It employs multiple different types of data mining algorithms, enables cooperations among these algorithms, and produces combined results in the form of rules.;CoLe2 is a multi-agent system with three types of agents that have the different roles of running data mining algorithms, performing combination of mining results, and driving the entire CoLe2 system work flow with knowledge-based strategies, respectively. The system has a work flow with two levels of loops. The outer loop performs data selection, mining algorithm selection and expectation adjustment strategies. The inner loop performs data mining execution and result combination, with additional knowledge-based strategies implemented in the agents. The agents exchange useful information during the running of the work flow to help each other.;A prototype system of the CoLe2 model is described. This prototype contains four different data mining algorithms (a classification algorithm, a sequence mining algorithm, an association rules mining algorithm and a descriptive mining algorithm), two combination strategies and instantiations of the knowledge-based strategies. The strategies instantiations include data selection based on a clustering algorithm, an asynchronous work flow for better turnaround time, relevance factor calculation, fuzzy condition matching, prediction histogram based rule similarity and rule grouping.;Experiments have been performed with two data sets -- a medium-sized data set of billing data from Calgary Health Region, and a large data set from the Alberta Kidney Disease Network. The experimental results show advantages of CoLe2 over individual data mining algorithms in terms of efficiency and result quality, as well as advantages over the CoLe model with only one level of work flow. Specialized experiments also prove the effectiveness of individual knowledge-based strategies.
机译:本文提出了一种基于代理的协作数据挖掘模型CoLe2。 CoLe 2旨在对大型异构数据集执行数据挖掘。它采用多种不同类型的数据挖掘算法,可以在这些算法之间进行协作,并以规则的形式产生组合结果。; CoLe2是一个具有三种类型的代理的多代理系统,它们具有运行数据挖掘算法的不同角色,执行挖掘结果的组合,并分别基于基于知识的策略来驱动整个CoLe2系统的工作流程。该系统具有两个循环级别的工作流程。外循环执行数据选择,挖掘算法选择和期望调整策略。内部循环执行数据挖掘执行和结果组合,并在代理中实施其他基于知识的策略。代理在工作流程运行期间交换有用的信息以互相帮助。;描述了CoLe2模型的原型系统。该原型包含四种不同的数据挖掘算法(分类算法,序列挖掘算法,关联规则挖掘算法和描述性挖掘算法),两种组合策略以及基于知识的策略的实例化。策略实例包括基于聚类算法的数据选择,更好的周转时间的异步工作流程,相关因子计算,模糊条件匹配,基于预测直方图的规则相似性和规则分组;已经对两个数据集进行了实验-中型数据集来自卡尔加里健康地区,而大数据集来自艾伯塔肾脏疾病网络。实验结果表明,在效率和结果质量方面,CoLe2优于单独的数据挖掘算法,并且仅在一个工作流程级别上优于CoLe模型。专门的实验还证明了基于个人知识的策略的有效性。

著录项

  • 作者

    Gao, Jie.;

  • 作者单位

    University of Calgary (Canada).;

  • 授予单位 University of Calgary (Canada).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 161 p.
  • 总页数 161
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号