【24h】

Learning Decision Trees from Distributed Datasets

机译:从分布式数据集中学习决策树

获取原文

摘要

Decision trees are an important data mining tool with many applications.Like many classification techniques,decision trees process the entire database in order to produce a generalization of the data that can be used subsequently for classification.Distributed databases are not amenable to such a global approach to generalization.This paper describes architecture of decision trees induction from distributed datasets which includes configuration manager retrieval data from distributed data,pruning data,and partial decision trees and data integration.In retrieval data,we explore a general strategy for explores a general strategy transforming traditional machine learning algorithms into algorithms for learning from distributed data;then we devise a pruning algorithms to optimal the data retrieval;finally we integrate the distributed sub-result data into final decision trees.
机译:决策树是具有许多应用程序的重要数据挖掘工具。与许多分类技术一样,决策树处理整个数据库以产生可随后用于分类的数据概括。分布式数据库不适合这种全局方法本文介绍了从分布式数据集归纳决策树的体系结构,包括配置管理器从分布式数据中检索数据,修剪数据以及部分决策树和数据集成。在检索数据中,我们探索了一种通用策略,用于探索通用策略的转换。将传统的机器学习算法集成到用于从分布式数据中学习的算法中;然后,我们设计出一种修剪算法以优化数据检索;最后,将分布式子结果数据集成到最终决策树中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号