...
首页> 外文期刊>IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems >An Extended Nonstrict Partially Ordered Set-Based Configurable Linear Sorter on FPGAs
【24h】

An Extended Nonstrict Partially Ordered Set-Based Configurable Linear Sorter on FPGAs

机译:在FPGA上扩展非延长非排序的基于部分有序的可配置线性分拣机

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Sorting is essential for many scientific and data processing problems. It is significant to improve the efficiency of sorting. Taking advantage of specialized hardware, parallel sorting, e.g., sorting networks and linear sorters, implements sorting in lower time complexity. However, most of them are designed based on the parallelization of algorithms, lacking consideration of specialized hardware structures. In this article, we propose an extended nonstrict partially ordered set-based configurable linear sorter on field-programmable gate arrays (FPGAs). First, we extend nonstrict partial order to the binary tuple and n-tuple nonstrict partial orders. Then, the linear sorting algorithm is defined based on them, with the consideration of hardware performance. It has 4N time complexity varying from 4 to 2 N as the tuple size varies. The number of comparisons reduces to N/2 in binary tuple-based sorting, which is half of the state-of-the-art insertion linear sorting. Finally, we implement the linear sorter on FPGAs. It consists of multiple customizable micro-cores, named sorting units (SUs). The SU packages the storage and comparison of the tuple. All the SUs are connected into a chain with simple communication, which makes the sorter fully configurable in length, bandwidth, and throughput. They also act the same in each clock cycle, so that the achieved frequency of the sorter improves. In our experiment, the sorter achieves at most 660-MHz frequency, 5.6 Gb/s throughput, and 87 times speed-up compared with the quick sort algorithm on general processors.
机译:排序对于许多科学和数据处理问题至关重要。提高分类效率是很重要的。利用专用硬件,并行排序,例如排序网络和线性分拣机,实现在较低时间复杂度的排序。然而,大多数是基于算法的并行化,缺乏考虑专业硬件结构的设计。在本文中,我们在现场可编程门阵列(FPGA)上提出了扩展的非特动部分有序的可配置线性分拣机。首先,我们向二进制元组和N组非推动不定部分命令扩展非临时题目。然后,考虑到硬件性能,基于它们定义了线性排序算法。由于元组大小变化,它有4n / n的复杂性不同于4到2 n。比较次数减少到基于二进制元组的排序中的N / 2,这是最先进的插入线性排序的一半。最后,我们在FPGA上实现了线性分拣机。它包括多种可自定义的微核,命名为排序单元(SUS)。 SU包包的存储和比较元组。所有SUS都连接到具有简单通信的链条中,这使得分拣机完全可配置长度,带宽和吞吐量。它们在每个时钟周期中也相同,因此实现了分拣机的频率改善。在我们的实验中,与一般处理器的快速排序算法相比,分拣机以最多660 MHz频率,5.6 GB / s吞吐量和87倍加速87倍。

著录项

  • 来源
  • 作者单位

    Jilin Univ Coll Comp Sci & Technol Changchun 13002 Peoples R China|Jilin Univ Minist Educ Key Lab Symbol Computat & Knowledge Engn Dept Comp Sci & Technol Zhuhai Lab Zhuhai Coll Zhuhai 519041 Peoples R China|Jilin Univ Minist Educ Key Lab Symbol Computat & Knowledge Engn Changchun 13002 Peoples R China;

    Jilin Univ Coll Comp Sci & Technol Changchun 13002 Peoples R China|Jilin Univ Minist Educ Key Lab Symbol Computat & Knowledge Engn Dept Comp Sci & Technol Zhuhai Lab Zhuhai Coll Zhuhai 519041 Peoples R China|Jilin Univ Minist Educ Key Lab Symbol Computat & Knowledge Engn Changchun 13002 Peoples R China;

    Jilin Univ Coll Comp Sci & Technol Changchun 13002 Peoples R China|Jilin Univ Minist Educ Key Lab Symbol Computat & Knowledge Engn Dept Comp Sci & Technol Zhuhai Lab Zhuhai Coll Zhuhai 519041 Peoples R China|Jilin Univ Minist Educ Key Lab Symbol Computat & Knowledge Engn Changchun 13002 Peoples R China;

    Jilin Univ Coll Comp Sci & Technol Changchun 13002 Peoples R China|Jilin Univ Minist Educ Key Lab Symbol Computat & Knowledge Engn Dept Comp Sci & Technol Zhuhai Lab Zhuhai Coll Zhuhai 519041 Peoples R China|Jilin Univ Minist Educ Key Lab Symbol Computat & Knowledge Engn Changchun 13002 Peoples R China;

    Univ Minho Ctr Algoritmi Guimaraes 4800058 Portugal;

    Jilin Univ Coll Comp Sci & Technol Changchun 13002 Peoples R China|Jilin Univ Minist Educ Key Lab Symbol Computat & Knowledge Engn Dept Comp Sci & Technol Zhuhai Lab Zhuhai Coll Zhuhai 519041 Peoples R China|Jilin Univ Minist Educ Key Lab Symbol Computat & Knowledge Engn Changchun 13002 Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Sorting; Field programmable gate arrays; Time complexity; Throughput; Bandwidth; Program processors; Hardware; Extended partial order; field-programmable gate arrays (FPGAs); linear sorter; tuple;

    机译:排序;现场可编程门阵列;时间复杂性;吞吐量;带宽;程序处理器;硬件;延长部分顺序;现场可编程门阵列(FPGA);线性分拣机;元组;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号