首页> 外文会议>Parallel processing for imaging applications >Performance Evaluation of Canny Edge Detection on a Tiled Multicore Architecture
【24h】

Performance Evaluation of Canny Edge Detection on a Tiled Multicore Architecture

机译:平铺多核架构上Canny边缘检测的性能评估

获取原文
获取原文并翻译 | 示例

摘要

In the last few years, a variety of multicore architectures have been used to parallelize image processing applications. In this paper, we focus on assessing the parallel speed-ups of different Canny edge detection parallelization strategies on the Tile64, a tiled multicore architecture developed by the Tilera Corporation. Included in these strategies are different ways Canny edge detection can be parallelized, as well as differences in data management. The two parallelization strategies examined were loop-level parallelism and domain decomposition. Loop-level parallelism is achieved through the use of OpenMP,1 and it is capable of parallelization across the range of values over which a loop iterates. Domain decomposition is the process of breaking down an image into subimages, where each subimage is processed independently, in parallel. The results of the two strategies show that for the same number of threads, programmer implemented, domain decomposition exhibits higher speed-ups than the compiler managed, loop-level parallelism implemented with OpenMP.
机译:在最近几年中,已经使用了多种多核体系结构来并行化图像处理应用程序。在本文中,我们着重于评估Tilera公司开发的Tileing多核体系结构Tile64上不同Canny边缘检测并行化策略的并行加速性能。这些策略包括Canny边缘检测可以并行化的不同方式,以及数据管理方面的差异。研究的两种并行化策略是循环级并行化和域分解。循环级并行性是通过使用OpenMP [1]来实现的,它能够跨一个循环进行迭代的值范围进行并行化。域分解是将图像分解为子图像的过程,其中每个子图像都独立,并行地进行处理。两种策略的结果表明,对于相同数量的线程(由程序员实现),与使用OpenMP实现的编译器管理的循环级并行性相比,域分解显示出更高的速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号