首页> 外文期刊>Indian Journal of Science and Technology >Implementation Analysis of Binaural Audio Crosstalk Cancellation on Heterogeneous Parallel Computing platforms using Mixed Non-Uniform Partitioned Convolution
【24h】

Implementation Analysis of Binaural Audio Crosstalk Cancellation on Heterogeneous Parallel Computing platforms using Mixed Non-Uniform Partitioned Convolution

机译:基于混合非均匀分区卷积的异构并行计算平台上双耳音频串扰消除的实现分析

获取原文
           

摘要

As general DSP processor architectures don’t have massive parallel architecture, they are not suitable to implement 3D audio virtual techniques at very long filters due to computational problems. To address these implementation issues of very long filters, an efficient method called Mixed Non-uniform Partitioned Convolution is proposed in this paper for implementing binaural audio crosstalk cancellation on heterogeneous parallel computing platforms. By using massive parallel architecture of heterogeneous platforms, the proposed approach is able to solve computational problems even at filter lengths of 65536 (32-bit floating point). The partitioning scheme followed in this paper is explained in detail to schedule partitions on various compute units of GPU device. The proposed approach was implemented on AMD GPUs using task parallel concept. The instruction level optimization was also provided for complex frequency multiplication and addition using OpenCL. The performance of this approach is compared against the existing techniques proposed by Garcia and Gardener. The cost vs. computational performance tradeoff comparison was given between proposed approach and existing methods. The comparison clearly shows that proposed approach is very efficient at very long filters and requires reasonable cost of implementation in terms of number of compute units. The combination of instruction level and algorithmic level optimizations make the proposed approach more suitable for implementation of not only stereo inputs based audio CTC but also multichannel inputs, particularly at very long filter lengths on parallel computing platforms.
机译:由于一般的DSP处理器架构没有大规模的并行架构,因此由于计算问题,它们不适合在很长的滤波器上实现3D音频虚拟技术。为了解决超长滤波器的这些实现问题,本文提出了一种有效的方法,称为混合非均匀分区卷积,用于在异构并行计算平台上实现双耳音频串扰消除。通过使用异构平台的大规模并行体系结构,即使在滤波器长度为65536(32位浮点)的情况下,所提出的方法也能够解决计算问题。本文详细介绍了分区方案,以计划在GPU设备的各种计算单元上进行分区。拟议的方法是使用任务并行概念在AMD GPU上实现的。还为使用OpenCL进行复杂的频率乘法和加法提供了指令级优化。将这种方法的性能与Garcia和Gardener提出的现有技术进行了比较。在提议的方法和现有方法之间进行了成本与计算性能折衷的比较。比较清楚地表明,所提出的方法在很长的滤波器上非常有效,并且就计算单元的数量而言需要合理的实施成本。指令级和算法级优化的结合使所提出的方法不仅更适合于基于立体声输入的音频CTC的实现,而且更适合于多通道输入的实现,尤其是在并行计算平台上非常长的滤波器长度下。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号