FPGA Based Co-design of Storage-side Query Filter for Big Data Systems

机译：基于FPGA的大数据量协同查询系统设计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we are interested in accelerating the processing of big data systems. We consider the architecture of storage and computing separated Big Data systems, and approach to improve the data query efficiency in the storage side. We propose an Field Programmable Gate Array (FPGA) based co-design of query filter on storage nodes to reduce the workloads of computing nodes and the communication overheads between them. The codesign of query filter is composed of software layer and FPGA layer. In software layer, we use the pointers to project the data in the RCFile format to reduce data transmission, and then formulate the combined predicate of SQL conditions into parameters. In FPGA layer, we design two filtering schemes on FPGA for data in RCFile format, i.e. parallel sequential filter and parallel pipeline filter, by which we can achieve that different columns and SQL queries are completely parallel. Based on TPC-H benchmark and Tencent data set, we conduct extensive experiments to evaluate our design, which can save averagely 76.2% of time overhead compared with Presto and 96.86% of time overhead compared with Hive.

机译：在本文中，我们感兴趣的是加速大数据系统的处理。我们考虑了存储和计算分离的大数据系统的体系结构，并在存储方面提高了数据查询效率。我们提出了一种基于现场可编程门阵列（FPGA）的存储节点查询滤波器协同设计方案，以减少计算节点的工作量和它们之间的通信开销。查询滤波器的协同设计由软件层和FPGA层组成。在软件层，我们使用指针将数据以RCFile格式投影，以减少数据传输，然后将SQL条件的组合谓词表示为参数。在FPGA层，我们在FPGA上对RCFile格式的数据设计了两种滤波方案，即并行顺序滤波和并行流水线滤波，实现了不同列和SQL查询的完全并行。基于TPC-H benchmark和腾讯数据集，我们进行了大量实验来评估我们的设计，与Presto相比，平均节省76.2%的时间开销，与Hive相比，平均节省96.86%的时间开销。

著录项

来源
《IEEE International System on Chip Conference》|2020年|25-30|共6页
会议地点
作者
Jinyu Zhan; Ying Li; Wei Jiang; Jianping Zhu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Filtering; Conferences; Pipelines; Big Data; Benchmark testing; Logic gates; Software;

机译：过滤;会议;管道;大数据;基准测试;逻辑门;软件;

相似文献

外文文献
中文文献
专利

1. SQLS: A Storm-Based Query Language System for Real-Time Stream Data Analysis [J] . JI Yimu, ZHANG Dianchao, SUN Yanfei, 电子学报（英文版） . 2016,第006期
2. SQLS：A Storm-Based Query Language System for Real-Time Stream Data Analysis [J] . JI Yimu1234, ZHANG Dianchao1, SUN Yanfei5, 电子学报：英文版 . 2016,第006期
3. AbIx: An Approach to Content-Based Approximate Query Processing in Peer-to-Peer Data Systems [J] . Chao-Kun Wang, Jian-Min Wang, Jia-Guang Sun, 计算机科学技术学报（英文版） . 2007,第002期
4. Geographic information system query optimisation algorithm based on redundant data deletion and filtering technology [J] . Chunyang Lu, Feng Wen International journal of internet protocol technology . 2018,第4期

机译：基于冗余数据删除和过滤技术的地理信息系统查询优化算法
5. An efficient image retrieval system with structured query based feature selection and filtering initial level relevant images using range query [J] . Annrose J., Christopher C. Seldev Optik: Zeitschrift fur Licht- und Elektronenoptik: = Journal for Light-and Electronoptic . 2018,第期

机译：一种有效的图像检索系统，具有基于结构化查询的特征选择和使用范围查询过滤初始级别相关图像
6. Comparing a knowledge-based and a data-driven method in querying data streams for system fault detection: A hydraulic drive system application [J] . Ahmad Alzghoul, Bjorn Backe, Magnus Lofstrand Computers in Industry . 2014,第8期

机译：比较基于知识和数据驱动的方法来查询数据流以进行系统故障检测：液压驱动系统应用
7. A co-design approach for accelerated SQL query processing via FPGA-based data filtering [C] . Andreas Becher, Daniel Ziener, Klaus Meyer-Wegener, 2015 International Conference on Field Programmable Technology . 2015

机译：通过基于FPGA的数据过滤加快SQL查询处理的协同设计方法
8. Design and implementation of an FPGA-based piecewise affine Kalman Filter for Cyber-Physical Systems. [D] . Mills, Aaron Joseph. 2016

机译：电子物理系统基于FPGA的分段仿射卡尔曼滤波器的设计与实现。
9. IV. Clinical Consultation Systems Medical Decision Support Systems and Clinical Research Data Bases: B. Clinical Research Data Bases: Medical Query Language [O] . Mary M. Morgan, Peter D. Beaman, Daniel J. Shusman, 1981

机译：IV。临床咨询系统医疗决策支持系统和临床研究数据库：B.临床研究数据库：医学查询语言
10. FPGA-based efficient hardware/software co-design for industrial systems with systematic sensor selection [O] . Deliparaschos, Kyriakos, Michail, Konstantinos, Zolotas, Argyrios, 2016

机译：基于FPGA的高效的硬件/软件协同设计，适用于具有系统传感器选择的工业系统
11. Graphic Interface for Attribute-Based Data Language Queries from a Personal Computer to the Multi-Lingual, Multi-Model, Multi-Backend Database System over an Ethernet Network. [R] . Sympson, W. G. 1989

机译：基于属性的数据语言查询的图形界面，从个人计算机到以太网上的多语言，多模型，多后端数据库系统。

FPGA Based Co-design of Storage-side Query Filter for Big Data Systems

摘要

著录项

相似文献

相关主题

期刊订阅