A Parallel Fast Fourier Transform Algorithm for Large-Scale Signal Data Using Apache Spark in Cloud

机译：云中使用Apache Spark并行处理大规模信号数据的快速傅立叶变换算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the field of signal process, Fast Fourier Transform (FFT) is a widely used algorithm to transform signal data from time to frequency. Unfortunately, with the exponential growth of data, traditional methods cannot meet the demand of large-scale computation on these big data because of three main challenges of large-scale FFT, i.e., big data size, real-time data processing and high utilization of compute resources. To satisfy these requirements, an optimized FFT algorithm in Cloud is deadly needed. In this paper, we introduce a new method to conduct FFT in Cloud with the following contributions: first, we design a parallel FFT algorithm for large-scaled signal data in Cloud; second, we propose a MapReduce-based mechanism to distribute data to compute nodes using big data processing framework; third, an optimal method of distributing compute resources is implemented to accelerate the algorithm by avoiding redundant data exchange between compute nodes. The algorithm is designed in MapReduce computation framework which contains three steps: data preprocessing, local data transform and parallel data transform to integrate processing results. The parallel FFT is implemented in a 16-node Cloud to process real signal data The experimental results reveal an obvious improvement in the algorithm speed. Our parallel FFT is approximately five times faster than FFT in Matlab in when the data size reaches 10 GB.

机译：在信号处理领域，快速傅立叶变换（FFT）是一种广泛使用的算法，可以将信号数据从时间转换为频率。不幸的是，随着数据的指数增长，传统方法无法满足对这些大数据进行大规模计算的需求，这是因为大规模FFT存在三个主要挑战，即大数据量，实时数据处理和高利用率。计算资源。为了满足这些要求，迫切需要一种在Cloud中优化的FFT算法。本文介绍了一种在Cloud中进行FFT的新方法，主要有以下贡献：首先，针对Cloud中的大规模信号数据，设计了并行FFT算法。其次，我们提出了一种基于MapReduce的机制，可以使用大数据处理框架将数据分发到计算节点。第三，实现了一种优化的分配计算资源的方法，通过避免计算节点之间的冗余数据交换来加速算法。该算法是在MapReduce计算框架中设计的，该框架包含三个步骤：数据预处理，本地数据转换和并行数据转换以整合处理结果。并行FFT在16节点云中实现，以处理实际信号数据。实验结果表明，算法速度有了明显的提高。当数据大小达到10 GB时，我们的并行FFT大约比Matlab中的FFT快五倍。

著录项

来源
《International conference on algorithms and architectures for parallel processing》|2018年|293-310|共18页
会议地点
作者
Cheng Yang; Weidong Bao; Xiaomin Zhu; Ji Wang; Wenhua Xiao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Fast fourier transform; Cloud computing; Apache spark; Parallel algorithm;

机译：快速傅立叶变换;云计算;阿帕奇火花;并行算法;

相似文献

外文文献
中文文献
专利

1. Application of parallel version two-dimensional fast Fourier transform algorithm, analog of the Cooley-Tukey algorithm, for digital image processing of satellite data [J] . Mikhail Noskov, Valeriy Tutatchikov, Mikhail Lapchik, E3S Web of Conferences . 2019,第4期

机译：并行版本的二维快速傅里叶变换算法（类似于Cooley-Tukey算法）在卫星数据的数字图像处理中的应用
2. Large-Scale Parallel Implementation of Hartree-Fock Exchange Energy on Real-Space Grids Using 3D-Parallel Fast Fourier Transform [J] . Takahashi Hideaki, Sakuraba Shun, Morita Akihiro Journal of chemical information and modeling . 2020,第3期

机译：使用3D平行快速傅里叶变换进行大规模并行实现Hartree-Fock Exchange能量的实际网格中的能量
3. A fast parallel attribute reduction algorithm using Apache Spark [J] . Yin Linzi, Qin Liyang, Jiang Zhaohui, Knowledge-Based Systems . 2021,第Jana5期

机译：使用Apache Spark的快速并行属性缩减算法
4. A Parallel Fast Fourier Transform Algorithm for Large-Scale Signal Data Using Apache Spark in Cloud [C] . Cheng Yang, Weidong Bao, Xiaomin Zhu, International Conference on Algorithms and Architectures for Parallel Processing . 2018

机译：云中Apache Spark的大规模信号数据并行快速傅里叶变换算法
5. Fast Fourier transform for option pricing: Improved mathematical modeling and design of an efficient parallel algorithm. [D] . Barua, Sajib. 2004

机译：期权定价的快速傅立叶变换：改进的数学建模和有效并行算法的设计。
6. Large-scale virtual screening on public cloud resources with Apache Spark [O] . Marco Capuccini, Laeeq Ahmed, Wesley Schaal, 2017

机译：使用Apache Spark对公共云资源进行大规模虚拟筛选
7. Parallel Zero-Copy Algorithms for Fast Fourier Transform and Conjugate Gradient using MPI Datatypes [O] . Torsten Hoefler, Steven Gottlieb 2010

机译：使用MPI数据类型的快速傅里叶变换和共轭梯度并行零复制算法
8. Parallel fast Fourier transforms for non power of two data [R] . Semeraro, B. D. 1994

机译：并行快速傅里叶变换用于两个数据的非幂

A Parallel Fast Fourier Transform Algorithm for Large-Scale Signal Data Using Apache Spark in Cloud

摘要

著录项

相似文献

相关主题

期刊订阅