首页> 外文会议>International conference on algorithms and architectures for parallel processing >A Parallel Fast Fourier Transform Algorithm for Large-Scale Signal Data Using Apache Spark in Cloud
【24h】

A Parallel Fast Fourier Transform Algorithm for Large-Scale Signal Data Using Apache Spark in Cloud

机译:云中使用Apache Spark并行处理大规模信号数据的快速傅立叶变换算法

获取原文

摘要

In the field of signal process, Fast Fourier Transform (FFT) is a widely used algorithm to transform signal data from time to frequency. Unfortunately, with the exponential growth of data, traditional methods cannot meet the demand of large-scale computation on these big data because of three main challenges of large-scale FFT, i.e., big data size, real-time data processing and high utilization of compute resources. To satisfy these requirements, an optimized FFT algorithm in Cloud is deadly needed. In this paper, we introduce a new method to conduct FFT in Cloud with the following contributions: first, we design a parallel FFT algorithm for large-scaled signal data in Cloud; second, we propose a MapReduce-based mechanism to distribute data to compute nodes using big data processing framework; third, an optimal method of distributing compute resources is implemented to accelerate the algorithm by avoiding redundant data exchange between compute nodes. The algorithm is designed in MapReduce computation framework which contains three steps: data preprocessing, local data transform and parallel data transform to integrate processing results. The parallel FFT is implemented in a 16-node Cloud to process real signal data The experimental results reveal an obvious improvement in the algorithm speed. Our parallel FFT is approximately five times faster than FFT in Matlab in when the data size reaches 10 GB.
机译:在信号处理领域,快速傅立叶变换(FFT)是一种广泛使用的算法,可以将信号数据从时间转换为频率。不幸的是,随着数据的指数增长,传统方法无法满足对这些大数据进行大规模计算的需求,这是因为大规模FFT存在三个主要挑战,即大数据量,实时数据处理和高利用率。计算资源。为了满足这些要求,迫切需要一种在Cloud中优化的FFT算法。本文介绍了一种在Cloud中进行FFT的新方法,主要有以下贡献:首先,针对Cloud中的大规模信号数据,设计了并行FFT算法。其次,我们提出了一种基于MapReduce的机制,可以使用大数据处理框架将数据分发到计算节点。第三,实现了一种优化的分配计算资源的方法,通过避免计算节点之间的冗余数据交换来加速算法。该算法是在MapReduce计算框架中设计的,该框架包含三个步骤:数据预处理,本地数据转换和并行数据转换以整合处理结果。并行FFT在16节点云中实现,以处理实际信号数据。实验结果表明,算法速度有了明显的提高。当数据大小达到10 GB时,我们的并行FFT大约比Matlab中的FFT快五倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号