...
首页> 外文期刊>Complex & Intelligent Systems >An insight into imbalanced Big Data classification: outcomes and challenges
【24h】

An insight into imbalanced Big Data classification: outcomes and challenges

机译:大数据分类失衡的见解:成果和挑战

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Abstract Big Data applications are emerging during the last years, and researchers from many disciplines are aware of the high advantages related to the knowledge extraction from this type of problem. However, traditional learning approaches cannot be directly applied due to scalability issues. To overcome this issue, the MapReduce framework has arisen as a “de facto” solution. Basically, it carries out a “divide-and-conquer” distributed procedure in a fault-tolerant way to adapt for commodity hardware. Being still a recent discipline, few research has been conducted on imbalanced classification for Big Data. The reasons behind this are mainly the difficulties in adapting standard techniques to the MapReduce programming style. Additionally, inner problems of imbalanced data, namely lack of data and small disjuncts, are accentuated during the data partitioning to fit the MapReduce programming style. This paper is designed under three main pillars. First, to present the first outcomes for imbalanced classification in Big Data problems, introducing the current research state of this area. Second, to analyze the behavior of standard pre-processing techniques in this particular framework. Finally, taking into account the experimental results obtained throughout this work, we will carry out a discussion on the challenges and future directions for the topic.
机译:摘要近年来,大数据的应用正在兴起,许多学科的研究人员意识到与从此类问题中提取知识相关的巨大优势。但是,由于可伸缩性问题,传统的学习方法无法直接应用。为了克服这个问题,MapReduce框架已成为一种“事实上的”解决方案。基本上,它以容错的方式执行“分而治之”的分布式过程,以适应商品硬件。仍然是一门新兴学科,关于大数据的不平衡分类的研究很少。其背后的原因主要是使标准技术难以适应MapReduce编程风格的困难。此外,在数据分区以适应MapReduce编程风格的过程中,数据不平衡的内部问题(即缺少数据和小的分离)更加突出。本文是根据三个主要支柱进行设计的。首先,介绍大数据问题中分类不平衡的第一个结果,介绍该领域的当前研究状态。其次,分析此特定框架中标准预处理技术的行为。最后,考虑到整个工作中获得的实验结果,我们将就该主题所面临的挑战和未来方向进行讨论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号