【24h】

Combining Distributed Classifies by Stacking

机译:通过堆叠组合分布式分类

获取原文

摘要

Many current mining tasks analyze data in environments with distributed computing nodes. Classification in such scenario needs to perform local mining task in each data site and then integrate local classifiers to a global model of the data. However, integration strategy can influence the performance and complexity of the final model. In this paper, based on the formalization of combining multiple classifiers by stacking in Distributed Data Mining, a new strategy to from meta-level training set is proposed, which can describe the vote made by each base-level classifiers. The experiment results show that our method achieve better performance for those datasets with highly skewed class distribution.
机译:许多当前挖掘任务分析具有分布式计算节点的环境中的数据。在这种情况下的分类需要在每个数据站点中执行本地挖掘任务,然后将本地分类器集成到数据的全局模型中。但是,集成策略可以影响最终模型的性能和复杂性。本文基于在分布式数据挖掘中堆叠多个分类器的形式化,提出了一种新的来自元级训练集的策略,可以描述每个基级分类器所做的投票。实验结果表明,我们的方法对具有高度偏斜类分布的数据集来实现更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号