Scaling the Construction of Wavelet Synopses for Maximum Error Metrics

Mytilinis Ioannis; Tsoumakos Dimitrios; Koziris Nectarios

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Scaling the Construction of Wavelet Synopses for Maximum Error Metrics

【24h】

Scaling the Construction of Wavelet Synopses for Maximum Error Metrics

机译：扩展小波概要的构造以获取最大误差指标

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Modern analytics involve computations over enormous numbers of data records. The volume of data and the stringent response-time requirements place increasing emphasis on the efficiency of approximate query processing. A major challenge over the past years has been the construction of synopses that provide a deterministic quality guarantee, often expressed in terms of a maximum error metric. By approximating sharp discontinuities, wavelet decomposition has proved to be a very effective tool for data reduction. However, existing wavelet thresholding schemes that minimize maximum error metrics are constrained with impractical complexities for large datasets. Furthermore, they cannot efficiently handle the multi-dimensional version of the problem. In order to provide a practical solution, we develop parallel algorithms that take advantage of key-properties of the wavelet decomposition and allocate tasks to multiple workers. To that end, we present (i) a general framework for the parallelization of existing dynamic programming algorithms, (ii) a parallel version of one such DP algorithm, and (iii) two highly efficient distributed greedy algorithms that can deal with data of arbitrary dimensionality. Our extensive experiments on both real and synthetic datasets over Hadoop show that the proposed algorithms achieve linear scalability and superior running-time performance compared to their centralized counterparts.

机译：现代分析涉及对大量数据记录的计算。数据量和严格的响应时间要求越来越强调近似查询处理的效率。过去几年中的主要挑战是提要的构建，这些提要提供确定性的质量保证，通常以最大误差度量表示。通过逼近尖锐的不连续点，小波分解已被证明是用于数据缩减的非常有效的工具。但是，现有的将最大误差度量最小化的小波阈值方案受到大型数据集不切实际的复杂性的约束。此外，他们无法有效处理问题的多维版本。为了提供一个实用的解决方案，我们开发了并行算法，这些算法利用了小波分解的关键属性，并将任务分配给多个工作人员。为此，我们提出了（i）现有动态编程算法并行化的通用框架，（ii）一种此类DP算法的并行版本，以及（iii）可以处理任意数据的两种高效分布式贪婪算法维度。我们在Hadoop上对真实和合成数据集进行的广泛实验表明，与集中式算法相比，所提出的算法可实现线性可扩展性和出色的运行时性能。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2019年第9期|1794-1808|共15页
作者
Mytilinis Ioannis; Tsoumakos Dimitrios; Koziris Nectarios;
展开▼
作者单位

NTUA Dept Elect & Comp Engn Zografos 15780 Greece;

Ionian Univ Kerkira 49100 Greece;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Approximate query processing; wavelet synopses; hadoop; distributed runtimes; maximum error metrics;

机译：近似查询处理;小波提要Hadoop分布式运行时;最大错误指标;

相似文献

外文文献
中文文献
专利

1. Wavelet Synopses for General Error Metrics [J] . MINOS GAROFALAKIS, AMIT KUMAR ACM transactions on database systems . 2005,第4期

机译：一般误差度量的小波提要
2. Two-dimensional wavelet synopses with maximum error bound and its application in parallel compression [J] . Li Xiaoyun, Fan Ruiqin, Zhang Hao Lan, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2019,第3aPta1期

机译：具有最大误差绑定的二维小波概要及其在并行压缩中的应用
3. Computing Unrestricted Synopses Under Maximum Error Bound [J] . Chaoyi Pang, Qing Zhang, Xiaofang Zhou, Algorithmica . 2013,第1期

机译：在最大误差范围内计算不受限制的概要
4. Image Compression Based on Restricted Wavelet Synopses with Maximum Error Bound [C] . Xiaoyun Li, Shizhong Huang, Huanyu Zhao, IEEE/ACM International Conference on Utility and Cloud Computing . 2016

机译：基于最大误差界的受限小波提要的图像压缩
5. Construction of orthogonal compactly-supported scaling functions and multiwavelets on arbitrary meshes. [D] . Kessler, Walter Bruce. 1997

机译：在任意网格上构造正交紧致支持的缩放函数和多小波。
6. WAVELET-BASED BAYESIAN ESTIMATION OF PARTIALLY LINEAR REGRESSION MODELSWITH LONG MEMORY ERRORS [O] . Kyungduk Ko, Leming Qu, Marina Vannucci -1

机译：基于小记忆误差的部分线性回归模型的小波贝叶斯估计
7. Unrestricted Wavelet Synopses under Maximum Error Bound [O] . Chaoyi Pang, Qing Zhang, David Hansen, 2014

机译：最大误差界限下的无限制小波概要

Scaling the Construction of Wavelet Synopses for Maximum Error Metrics

摘要

著录项

相似文献

相关主题

期刊订阅