Parallel POD Compression of Time-Varying Big Datasets Using m-Swap on the K Computer

机译：K计算机上使用m-Swap的时变大数据集的并行POD压缩

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Thanks to the supercomputer, more and more complicated simulations are successfully achieved. On the other hand, to analyze and understand the intrinsic properties of the big datasets from the simulations is an urgent research for scientists. However, the explosive size of the big datasets makes such kind of task difficult. Therefore, reduction of the size of the big datasets becomes an important topic, in which data compression and parallel computing are the two key techniques. In this paper, we presented a parallel data compression approach to reduce the size of time-varying big datasets. Firstly, we employ the proper orthogonal decomposition (POD) method for compression. The POD method can extract the underlying features of datasets to greatly reduce the size of big datasets. Meanwhile, the compressed datasets can be decompressed linearly. This feature can help scientists to interactively visualize big datasets for analysis. Then, we introduced a novel m-swap method to effectively parallelize the POD compression algorithm. The m-swap method can reach a high performance through fully using all parallel computing processors. In another word, no idle processors exist in the parallel compression process. Furthermore, the m-swap method can greatly reduce the cost of interprocessor communication. This is achieved by controlling the data transfer among 2m processors to obtain the best balance of computation cost of these processors. Finally, the effectiveness of our method will be demonstrated through compressing several time-varying big datasets on the K computer with ten thousands of processors.

机译：由于超级计算机，成功实现了越来越复杂的模拟。另一方面，分析和理解模拟中大数据集的内在特性是对科学家的紧急研究。然而，大数据集的爆炸大小使得这种任务困难。因此，减少大数据集的大小成为一个重要主题，其中数据压缩和并行计算是两种关键技术。在本文中，我们介绍了一种并行数据压缩方法，以减小时变大数据集的大小。首先，我们采用适当的正交分解（POD）用于压缩方法。 POD方法可以提取数据集的底层特征，从而大大减小大数据集的大小。同时，压缩的数据集可以线性地压缩。此功能可以帮助科学家互动地可视化大数据集进行分析。然后，我们介绍了一种新的M-Swap方法，以有效地并行化POD压缩算法。通过全部并行计算处理器，M-Swap方法可以通过完全达到高性能。在另一个单词中，并行压缩过程中没有存在空闲处理器。此外，M-SWAP方法可以大大降低迭代源通信的成本。这是通过控制2M处理器之间的数据传输来实现这些处理器的计算成本的最佳平衡来实现。最后，我们的方法的有效性将通过压缩k计算机上的几个时变大数据集来证明，其中万辆有一万个处理器。

著录项

来源
《IEEE International Congress on Big Data》|2014年|438-445|共8页
会议地点
作者
Bi Chongke; Ono Kenji; Yang Lu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Compression algorithms; Image coding; Parallel algorithms; Program processors; Three-dimensional displays; Vectors; POD; m-swap; parallel compression;

机译：压缩算法;图像编码;并行算法程序处理器;三维显示器;向量;荚;交换并行压缩;

相似文献

外文文献
中文文献
专利

1. ASTRAL-MP: scaling ASTRAL to very large datasets using randomization and parallelization [J] . Yin John, Zhang Chao, Mirarab Siavash Bioinformatics . 2019,第20期

机译：Astral-MP：使用随机化和并行化将星形缩放到非常大的数据集
2. Corrigendum to 'Co-processing heterogeneous parallel index for multi-dimensional datasets' [J. Parallel Distrib. Comput. 113 (2018) 195-203] [J] . Jinwoong Kim, Beomseok Nam Journal of Parallel and Distributed Computing . 2018,第jula期

机译：“对多维数据集进行异类并行索引协同处理”更正[J.并行分配。计算113（2018）195-203]
3. Analysis of Parallel Algorithms on SMP Node and Cluster of Workstations Using Parallel Programming Models with New Tile-based Method for Large Biological Datasets [J] . D. D. Shrimankar, S. R. Sathe Bioinformatics and Biology Insights . 2016,第Supplaa2期

机译：SMP节点和工作站集群上并行算法的并行编程模型与基于图块的大型生物数据集新方法并行分析
4. Parallel POD Compression of Time-Varying Big Datasets Using m-Swap on the K Computer [C] . Bi Chongke, Ono Kenji, Yang Lu IEEE International Congress on Big Data . 2014

机译：在K计算机上使用M-Swap的时变大数据集的并行POD压缩
5. Optimizing DCT-Based Lossy Compression for Scientific Datasets [D] . ?Chen, Jiaxi 2020

机译：优化基于DCT的科学数据集的损耗压缩
6. Compression of Large genomic datasets using COMRAD on Parallel Computing Platform [O] . Christopher Leela Biji, Manu K Madhu, Vineetha Vishnu, 2015

机译：在并行计算平台上使用COMRAD压缩大型基因组数据集
7. Double Tree Wavelet Image Compression On Parallel Mimd Computers [O] . A. Uhl, A. Bruckmann 2007

机译：并行mimd计算机上的双树小波图像压缩

Parallel POD Compression of Time-Varying Big Datasets Using m-Swap on the K Computer

摘要

著录项

相似文献

相关主题

期刊订阅