首页> 外文会议>International Conference on High-Performance and Embedded Architectures and Compilers >Parallelization Schemes for Memory Optimization on the Cell Processor: A Case Study on the Harris Corner Detector
【24h】

Parallelization Schemes for Memory Optimization on the Cell Processor: A Case Study on the Harris Corner Detector

机译:电池处理器上存储器优化的并行化方案:哈里斯角探测器的案例研究

获取原文

摘要

The Cell processor is a typical example of a heterogeneous multiprocessor on-chip architecture that uses several levels of parallelism to deliver high performance. Reducing the gap between peak performance and effective performance is the challenge for software tool developers and the application developers. Image processing and media applications are typical "main stream" applications. We use the Harris algorithm for the detection of interest points in an image as a benchmark to compare the performance of several parallel schemes on a Cell processor. The impact of the DMA controlled data transfers and the synchronizations between SPEs explains the differences between the performance of the different parallelization schemes. The scalability of the architecture is modeled and evaluated.
机译:单元处理器是异构多处理器片上架构的典型示例,其使用多个水平的并行性来提供高性能。降低峰值性能与有效性能之间的差距是软件工具开发人员和应用程序开发人员的挑战。图像处理和媒体应用是典型的“主流”应用程序。我们使用Harris算法检测图像中的感兴趣点作为基准,以比较细胞处理器上的几个并行方案的性能。 DMA控制数据传输的影响和SPE之间的同步解释了不同并行化方案的性能之间的差异。建模和评估架构的可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号