...
首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Fast Online Set Intersection for Network Processing on FPGA
【24h】

Fast Online Set Intersection for Network Processing on FPGA

机译:在FPGA上进行网络处理的快速在线集交叉点

获取原文
获取原文并翻译 | 示例

摘要

Online set intersection operations have been widely used in network processing tasks, such as Quality of Service differentiation, firewall processing, and packet/traffic classification. The major challenge for online set intersection is to sustain line-rate processing speed; accelerating set intersection using state-of-the-art hardware devices is of great interest to the research community. In this paper, we present a novel high-performance set intersection approach on FPGA. In our approach, each element in any set is represented by a combination of Group ID (GID) and Bit Stride (BS); all the sets are intersected using linear merge techniques and bitwise AND operations. We map our online set intersection algorithm onto hardware; this is done by constructing modular Processing Element (PE) and concatenating multiple PEs into a tree-based parallel architecture. In order to improve the throughput on a state-of-the-art FPGA, we feed all the inputs to FPGA in a streaming fashion with the help of the synchronization GIDs. Post place-and-route results show that, for a typical set intersection problem in network processing, our design can intersect eight sets, each of up to 32 K elements, at a throughput of 47.4 Thousand Intersections Per Second (KIPS) and a latency of 94.8μ s per batch of inputs. Compared to the classic linear merge or bitwise AND techniques on state-of-the-art multi-core processors, our designs on FPGA achieves up to 66× throughput improvement and 80× latency reduction.
机译:在线集交叉口操作已广泛用于网络处理任务中,例如服务质量区分,防火墙处理和数据包/流量分类。在线集合路口的主要挑战是维持线速处理速度。使用最新的硬件设备来加速集合路口是研究界非常感兴趣的。在本文中,我们提出了一种在FPGA上的新型高性能集合交集方法。在我们的方法中,任何组中的每个元素都由组ID(GID)和位跨度(BS)的组合表示;使用线性合并技术和按位与运算将所有集合相交。我们将在线集合相交算法映射到硬件上;这是通过构造模块化处理元素(PE)并将多个PE连接到基于树的并行体系结构中来完成的。为了提高最新型FPGA的吞吐量,我们借助同步GID以流方式将所有输入馈送到FPGA。布局后的结果表明,对于网络处理中的典型集合相交问题,我们的设计可以相交8个集合,每个集合最多32 K元素,吞吐量为每秒47.4千个交叉点(KIPS)和延迟每批输入94.8μs。与先进的多核处理器上的经典线性合并或按位与技术相比,我们在FPGA上的设计实现了高达66倍的吞吐量提高和80倍的延迟减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号