首页> 外文会议>Annual KES conference >An Analysis of Clustering Approaches to Distributed Learning on Heterogeneously Distributed Datasets
【24h】

An Analysis of Clustering Approaches to Distributed Learning on Heterogeneously Distributed Datasets

机译:异构数据集上分布式学习的聚类方法分析

获取原文

摘要

Advances in communication technologies have contributed to the proliferation of distributed datasets. The most effective approach to distributed learning is to learn locally and then combine the local models. In general, distributed algorithms assume that there is a single model that could be induced from the distributed datasets. Under this view, distribution is treated exclusively as a technical issue. However, real-world distributed datasets frequently present an intrinsic data skewness among their partitions. Despite of its importance, up to the authors' knowledge, its impact has been barely investigated in the literature. In this paper, the performance of different cluster-based distributed learning methods is analyzed over distinct scenarios by incrementing the differences in the probabilistic distribution of data among partitions. Based on these results the best approach is suggested at every scenario.
机译:通信技术的进步促进了分布式数据集的扩散。分布式学习的最有效方法是在本地学习,然后结合本地模型。通常,分布式算法假定可以从分布式数据集中导出一个模型。在这种观点下,分发仅被视为技术问题。但是,现实世界中的分布式数据集经常在其分区之间呈现出固有的数据偏斜。尽管其重要性,据作者所知,其影响尚未在文献中进行过研究。在本文中,通过增加分区之间数据概率分布的差异,分析了在不同情况下不同基于群集的分布式学习方法的性能。根据这些结果,在每种情况下都建议最好的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号