首页> 外文会议>IEEE International Conference on Fuzzy Systems >Chi-Spark-RS: An Spark-built evolutionary fuzzy rule selection algorithm in imbalanced classification for big data problems
【24h】

Chi-Spark-RS: An Spark-built evolutionary fuzzy rule selection algorithm in imbalanced classification for big data problems

机译:Chi-Spark-RS:Spark构建的大数据问题不平衡分类中的进化模糊规则选择算法

获取原文

摘要

The significance and benefits of addressing classification tasks in Big Data applications is beyond any doubt. To do so, learning algorithms must be scalable to cope with such a high volume of data. The most suitable option to reach this objective is by using a MapReduce programming scheme, in which algorithms are automatically executed in a distributed and fault tolerant way. Among different available tools that support this framework, Spark has emerged as a “de facto” solution when using iterative approaches. In this work, our goal is to design and implement an Evolutionary Fuzzy Rule Selection algorithm within a Spark environment. To do so, we build different local rule bases within each Map Task that are later optimized by means of a genetic process. With this procedure, we seek to minimize the total number of rules that are gathered by each Reduce task to obtain a compact and accurate Fuzzy Rule Based Classification System. In particular, we set the experimental framework in the scenario of imbalanced classification. Therefore, the final objective will be analyzing the best synergy between the novel Evolutionary Fuzzy Rule Selection algorithm and the solutions applied to cope with skewed class distributions, namely cost-sensitive learning, random under-sampling and random-oversampling.
机译:毫无疑问,解决大数据应用程序中的分类任务的重要性和好处。为此,学习算法必须可扩展以应对如此大量的数据。实现此目标的最合适选择是使用MapReduce编程方案,在该方案中,算法将以分布式且容错的方式自动执行。在支持该框架的各种可用工具中,Spark在使用迭代方法时已成为“事实”解决方案。在这项工作中,我们的目标是在Spark环境中设计和实现一种进化模糊规则选择算法。为此,我们在每个“地图任务”中建立了不同的本地规则库,随后通过遗传过程对其进行了优化。通过此过程,我们力求最小化每个Reduce任务收集的规则总数,以获得紧凑且准确的基于模糊规则的分类系统。特别是,我们在不平衡分类的情况下设置了实验框架。因此,最终目标将是分析新颖的进化模糊规则选择算法与用于解决倾斜的类分布的解决方案(即成本敏感型学习,随机欠采样和随机过采样)之间的最佳协同作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号