首页> 外文会议>2011 17th IEEE International Conference on Parallel and Distributed Systems >Classifier Grouping to Enhance Data Locality for a Multi-threaded Object Detection Algorithm
【24h】

Classifier Grouping to Enhance Data Locality for a Multi-threaded Object Detection Algorithm

机译:分类器分组以增强数据局部性的多线程对象检测算法

获取原文

摘要

Object detection has become an enabling function for modern smart embedded devices to perform intelligent applications and interact with the environment appropriately and promptly. However, the limited computation resource of embedded devices has become a barrier to execute the computation intensive object detection algorithm. Leveraging the multi-threading scheme on embedded multi-core systems provides an opportunity to boost the performance. However, the memory bottleneck limits the performance scalability. Improving data locality of applications and maximizing the data reuse for on-chip caches have therefore become critical design concerns. This paper comprehensively analyzes the memory behavior and data locality of a multi-threaded object detection algorithm. A novel Classifier-Grouping scheme is proposed to significantly enhance the data reuse for on-chip caches of embedded multicore systems. By executing a multi-threaded object detection algorithm on a cycle-accurate multi-core simulator, the proposed approach can achieve up to 62% better performance when compared with the original parallel program.
机译:对象检测已成为现代智能嵌入式设备执行智能应用程序并与环境进行适当,迅速交互的一种启用功能。然而,嵌入式设备有限的计算资源已经成为执行计算密集型对象检测算法的障碍。在嵌入式多核系统上利用多线程方案可提供提高性能的机会。但是,内存瓶颈限制了性能可伸缩性。因此,改善应用程序的数据局部性和最大程度地提高片上高速缓存的数据重用性已成为关键的设计关注点。本文全面分析了多线程对象检测算法的内存行为和数据局部性。提出了一种新颖的分类器分组方案,以显着提高嵌入式多核系统的片上高速缓存的数据重用性。通过在周期精确的多核模拟器上执行多线程对象检测算法,与原始并行程序相比,所提出的方法可以实现高达62%的更好性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号