首页> 外文期刊>Knowledge and information systems >Modeling and implementing distributed data mining strategies in JaCa-DDM
【24h】

Modeling and implementing distributed data mining strategies in JaCa-DDM

机译:jaca-ddm中的模型和实施分布式数据挖掘策略

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This work introduces JaCa-DDM, a novel distributed data mining system founded on the agents and artifacts paradigm, conceived to design, implement, deploy, and evaluate learning strategies. Jason rational agents conform to such strategies to cope with distributed computing environments, where CArtAgO artifacts encapsulate learning algorithms, data sources, evaluation tools, and other services implemented in Weka for data mining tasks. The set of strategies presented in this paper aims at encouraging the use of JaCa-DDM to develop new ones, suited to different needs. For this, our system provides tools to evaluate the resulting models in terms of accuracy, number of instances employed to learn, time of convergence, and volume of communications. Although the emphasis in decision trees, JaCa-DDM can be easily extended by adopting new artifacts, e.g., for meta-learning. The main contributions of the paper are as follows: (i) From the multi-agent systems perspective, our approach illustrates how to exploit the so-called "agentification" of Weka for the sake of code reusability, while preserving the benefits of reasoning at the Belief-Desire-Intention level with Jason; (ii) from the data mining perspective, JaCa-DDM is promoted as an extensible tool to define and test distributed strategies; and (iii) a set of strategies including centralizing, meta-learning and Windowing-based approaches, is carefully analyzed to provide comparisons among them.
机译:这项工作介绍了Jaca-DDM,这是一个新颖的分布式数据挖掘系统,创立在代理商和文物范式范式上,设想设计,实施,部署和评估学习策略。 Jason Rational Agent符合应对分布式计算环境的这种策略,其中Cartago工件封装了Weka中实现的学习算法,数据源,评估工具以及用于数据挖掘任务的其他服务。本文提出的一系列策略旨在鼓励使用Jaca-DDM开发新的,适合不同的需求。为此,我们的系统提供了在学习,收敛时间和通信量的准确性下评估所产生的模型的工具。虽然决策树的重点,但是通过采用新的伪像,例如用于元学习,可以轻松扩展Jaca-DDM。本文的主要贡献如下:(i)从多代理系统的角度来看,我们的方法是如何利用Weka的所谓“代理程序”,以获得代码可重用性,同时保留推理的好处杰森的信念欲望 - 意图水平; (ii)从数据挖掘角度来看,Jaca-DDM被推广为可扩展工具来定义和测试分布式策略; (iii)一系列策略包括集中,元学习和基于窗口的方法,经过仔细分析,以便在其中提供比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号