首页> 外文期刊>IEEE Transactions on Computers >Scalable hardware priority queue architectures for high-speed packet switches
【24h】

Scalable hardware priority queue architectures for high-speed packet switches

机译:用于高速数据包交换机的可扩展硬件优先级队列体系结构

获取原文
获取原文并翻译 | 示例
           

摘要

With effective packet-scheduling mechanisms, modern integrated networks can support the diverse quality-of-service requirements of emerging applications. However, arbitrating between a large number of small packets on a high-speed link requires an efficient hardware implementation of a priority queue. To highlight the challenges of building scalable priority queue architectures, this paper includes a detailed comparison of four existing approaches: a binary tree of comparators, priority encoder with multiple first-in-first-out lists, shift register, and systolic array. Based on these comparison results, we propose two new architectures that scale to the large number of packets (N) and large number of priority levels (P) necessary in modern switch designs. The first architecture combines the faster clock speed of a systolic array with the lower memory requirements of a shift register, resulting in a hybrid design; a tunable parameter allows switch designers to carefully balance the trade-off between bus loading and chip area. We then extend this architecture to serve multiple output ports in a shared-memory switch. This significantly decreases complexity over the traditional approach of dedicating a separate priority queue to each outgoing link. Using the Verilog hardware description language and the Epoch silicon compiler, we have designed and simulated these two new architectures, as well as the four existing approaches. The simulation experiments compare the designs across a range of priority queue sizes and performance metrics, including enqueue/dequeue speed, chip area, and number of transistors.
机译:借助有效的数据包调度机制,现代集成网络可以支持新兴应用程序的多种服务质量要求。但是,在高速链路上的大量小数据包之间进行仲裁需要优先级队列的高效硬件实现。为了突出构建可伸缩优先级队列体系结构的挑战,本文包括对四种现有方法的详细比较:比较器的二进制树,具有多个先进先出列表的优先级编码器,移位寄存器和脉动阵列。根据这些比较结果,我们提出了两种新的体系结构,它们可以适应现代交换机设计中所需的大量数据包(N)和大量优先级(P)。第一种架构将脉动阵列的更快时钟速度与移位寄存器的较低内存需求结合在一起,从而实现了混合设计。可调参数使开关设计人员可以仔细平衡总线负载和芯片面积之间的权衡。然后,我们扩展此体系结构以在共享内存交换机中为多个输出端口提供服务。与将单独的优先级队列专用于每个传出链路的传统方法相比,这大大降低了复杂性。使用Verilog硬件描述语言和Epoch芯片编译器,我们设计并模拟了这两种新架构以及四种现有方法。仿真实验在一系列优先队列大小和性能指标(包括入队/出队速度,芯片面积和晶体管数量)之间对设计进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号