首页> 外文期刊>Journal of signal processing systems for signal, image, and video technology >Data Flow Algorithms for Processors with Vector Extensions
【24h】

Data Flow Algorithms for Processors with Vector Extensions

机译:具有向量扩展的处理器的数据流算法

获取原文
获取原文并翻译 | 示例
       

摘要

Full use of the parallel computation capabilities of present and expected CPUs and GPUs requires use of vector extensions. Yet many actors in data flow systems for digital signal processing have internal state (or, equivalently, an edge that loops from the actor back to itself) that impose serial dependencies between actor invocations that make vectorizing across actor invocations impossible. Ideally, issues of inter-thread coordination required by serial data dependencies should be handled by code written by parallel programming experts that is separate from code specifying signal processing operations. The purpose of this paper is to present one approach for so doing in the case of actors that maintain state. We propose a methodology for using the parallel scan (also known as prefix sum) pattern to create algorithms for multiple simultaneous invocations of such an actor that results in vectorizable code. Two examples of applying this methodology are given: (1) infinite impulse response filters and (2) finite state machines. The correctness and performance of the resulting IIR filters and one class of FSMs are studied.
机译:充分利用当前和预期的CPU和GPU的并行计算功能需要使用向量扩展。然而,用于数字信号处理的数据流系统中的许多参与者都具有内部状态(或等效地,从参与者循环回自身的边缘),从而在参与者调用之间施加了串行依赖性,从而使得无法跨参与者调用进行矢量化。理想情况下,串行数据相关性所需的线程间协调问题应由并行编程专家编写的代码处理,该代码应与指定信号处理操作的代码分开。本文的目的是提出一种在行为者保持状态的情况下这样做的方法。我们提出了一种使用并行扫描(也称为前缀和)模式来创建算法的方法,该算法可多次调用此类参与者,从而产生可矢量化的代码。给出了应用这种方法的两个例子:(1)无限冲激响应滤波器和(2)有限状态机。研究了所得IIR滤波器和一类FSM的正确性和性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号