首页> 外文期刊>International Journal of Distributed and Parallel Systems >Highly Scalable, Parallel and Distributed Adaboost Algorithm Using Light Weight Threads and Web Services on A Network of Multi-Core Machines
【24h】

Highly Scalable, Parallel and Distributed Adaboost Algorithm Using Light Weight Threads and Web Services on A Network of Multi-Core Machines

机译:在多核计算机网络上使用轻量级线程和Web服务的高度可扩展,并行和分布式Adaboost算法

获取原文
           

摘要

AdaBoost is an important algorithm in machine learning and is being widely used in object detection. AdaBoost works by iteratively selecting the best amongst weak classifiers, and then combines several weak classifiers to obtain a strong classifier. Even though AdaBoost has proven to be very effective, its learning execution time can be quite large depending upon the application e.g., in face detection, the learning time can be several days. Due to its increasing use in computer vision applications, the learning time needs to be drastically reduced so that an adaptive near real time object detection system can be incorporated. In this paper, we develop a hybrid parallel and distributed AdaBoost algorithm that exploits the multiple cores in a CPU via light weight threads, and also uses multiple machines via a web service software architecture to achieve high scalability. We present a novel hierarchical web services based distributed architecture and achieve nearly linear speedup up to the number of processors available to us. In comparison with the previously published work, which used a single level master-slave parallel and distributed implementation [1] and only achieved a speedup of 2.66 on four nodes, we achieve a speedup of 95.1 on 31 workstations each having a quad-core processor, resulting in a learning time of only 4.8 seconds per feature.
机译:AdaBoost是机器学习中的重要算法,并已广泛用于对象检测。 AdaBoost的工作方式是在弱分类器中反复选择最佳分类器,然后组合多个弱分类器以获得强分类器。即使AdaBoost已被证明非常有效,但其学习执行时间仍可能会很大,具体取决于应用程序(例如在人脸检测中)的学习时间可能为几天。由于其在计算机视觉应用中的越来越多的使用,学习时间需要大大减少,以便可以合并自适应近实时物体检测系统。在本文中,我们开发了一种混合并行和分布式AdaBoost算法,该算法通过轻量级线程利用CPU中的多个内核,并通过Web服务软件体系结构使用多台计算机来实现高可伸缩性。我们提出了一种基于分布式体系结构的新型分层Web服务,并实现了几乎线性的加速,达到了可用处理器的数量。与先前发布的使用单级主从并行和分布式实现[1]的工作相比,仅在四个节点上实现了2.66的加速,相比之下,我们在31个具有四核处理器的工作站上实现了95.1的加速,因此每个功能的学习时间仅为4.8秒。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号