首页> 外文会议>IEEE International Conference on Cloud Computing Technology and Science >Intelligent MapReduce Based Framework for Labeling Instances in Evolving Data Stream
【24h】

Intelligent MapReduce Based Framework for Labeling Instances in Evolving Data Stream

机译:基于智能MapReduce在不断发展的数据流中的标记实例的框架

获取原文

摘要

In our current work, we have proposed a multi-tiered ensemble based robust method to address all of the challenges of labeling instances in evolving data stream. Bottleneck of our current work is, it needs to build ADABOOST ensembles for each of the numeric features. This can face scalability issue as number of features can be very large at times in data stream. In this paper, we propose an intelligent approach to build these large number of ADABOOST ensembles with MapReduce based parallelism. We show that, this approach can help our base method to achieve significant scalability without compromising classification accuracy. We analyze different aspects of our design to depict advantages and disadvantages of the approach. We also compare and analyze performance of the proposed approach in terms of execution time, speedup and scale up.
机译:在我们当前的工作中,我们提出了一种基于多分层的合奏的强大方法,可以解决标签实例在不断发展的数据流方面的所有挑战。我们当前工作的瓶颈是,它需要为每个数字功能构建Adaboost合奏。这可以面临可扩展性问题,因为在数据流中的时间数量可能非常大。在本文中,我们提出了一种智能方法来构建与基于MapReduce的并行性的大量Adaboost合奏。我们表明,这种方法可以帮助我们的基础方法实现显着的可扩展性,而不会影响分类准确性。我们分析了我们设计的不同方面,以描绘该方法的优缺点。我们还在执行时间,加速和扩展方面进行比较和分析所提出的方法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号