Fast Approximation of the Top-k Items in Data Streams Using a Reconfigurable Accelerator

机译：使用可重构的加速器快速近似数据流中的顶部K项

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel method for finding the top-k items in data streams using a reconfigurable accelerator. The accelerator is capable of extracting an approximate list of the topmost frequently occurring items in an input stream, which is only scanned once without the need for random-access. The accelerator is based on a hardware architecture that implements the well-known Probabilistic sampling algorithm by mapping its main processing stages to two custom systolic arrays. The proposed architecture is the first hardware implementation of this algorithm, which shows better scalability compared to other architectures that are based on other stream algorithms. When implemented on an Intel Arria 10 FPGA (10AX115N2F45E1SG), 50% of the FPGA chip is sufficient for 3000+ Processing Elements (PEs). Experimental results on both synthetic and real input datasets showed very good accuracy and significant throughput gains compared to existing solutions. With achieved throughputs exceeding 300 Million items/s, we report average speedups of 20x compared to typical software implementations, 1.5x compared to GPU-accelerated implementations, and 1.8x compared to the fastest FPGA implementation.

机译：本文介绍了使用可重构的加速器在数据流中查找顶-K项的新方法。加速器能够在输入流中提取最顶层最常见的项目的近似列表，这仅扫描一次，而无需随机接入。加速器基于硬件架构，其通过将其主要处理阶段映射到两个自定义收缩阵列来实现众所周知的概率采样算法。所提出的架构是该算法的第一个硬件实现，其与基于其他流算法的其他架构相比，该算法显示了更好的可伸缩性。当在Intel Arria 10 FPGA（10AX115N2F45E1SG上），50％的FPGA芯片足以3000+处理元件（PE）。与现有解决方案相比，合成和实际输入数据集的实验结果显示出非常好的准确性和显着的吞吐量收益。随着吞吐量超过3亿物品，我们向典型的软件实现相比，与GPU加速的实现相比，将平均速度为20倍，与最快的FPGA实现相比，1.5倍。

著录项

来源
《International Symposium on Applied Reconfigurable Computing》|2021年|3-17|共15页
会议地点
作者
Ali Ebrahim; Jalal Khalifat;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Data stream; Probabilistic sampling; Top-k items; FPGA;

机译：数据流;概率抽样;Top-K项目;FPGA.;
入库时间 2022-08-26 13:58:26

相似文献

外文文献
中文文献
专利

1. A Fast and Efficient Algorithm for Finding Frequent Items over Data Stream [J] . Ling Chen1, Yixin Chen2, Li Tu3 Journal of Computers . 2012,第7期

机译：一种快速有效的算法，用于在数据流中查找频繁的项目
2. Fast and memory efficient mining of high-utility itemsets from data streams: with and without negative item profits [J] . Hua-Fu Li, Hsin-Yun Huang, Suh-Yin Lee Knowledge and information systems . 2011,第3期

机译：快速且内存高效地从数据流中挖掘高功能项集：有或没有负项利润
3. Fast and memory efficient mining of high-utility itemsets from data streams: with and without negative item profits [J] . Hua-Fu Li, Hsin-Yun Huang, Suh-Yin Lee Knowledge and Information Systems . 2011,第3期

机译：快速且内存有效地从数据流中挖掘高功能项集：有或没有负项利润
4. SSS: An Accurate and Fast Algorithm for Finding Top-k Hot Items in Data Streams [C] . Junzhi Gong, Deyu Tian, Dongsheng Yang, IEEE International Conference on Big Data and Smart Computing . 2018

机译：SSS：一种用于查找数据流中前k个热门项目的准确且快速的算法
5. Ad-hoc top-k query answering for data streams. [D] . Sarkas, Nikolaos. 2007

机译：数据流的临时top-k查询应答。
6. Streaming chunk incremental learning for class-wise data stream classification with fast learning speed and low structural complexity [O] . Prem Junsawang, Suphakant Phimoltares, Chidchanok Lursinsap 2012

机译：流式块增量学习，用于以快速的学习速度和较低的结构复杂度对类数据流进行分类
7. Mining top-K frequent items in a data stream with flexible sliding windows [O] . Hoang TL Thanh Lam, Calders TGK Toon 2010

机译：使用灵活的滑动窗口挖掘数据流中的前K个频繁项

Fast Approximation of the Top-k Items in Data Streams Using a Reconfigurable Accelerator

摘要

著录项

相似文献

相关主题

期刊订阅