首页> 外文OA文献 >An Implementation of a Three Dimensional Computational Pipeline with Minimal Latency and Maximum Throughput for LU Factorization Using Field Programmable Gate Arrays
【2h】

An Implementation of a Three Dimensional Computational Pipeline with Minimal Latency and Maximum Throughput for LU Factorization Using Field Programmable Gate Arrays

机译:使用现场可编程门阵列实现LU分解的最小延迟和最大吞吐量的三维计算管道的实现

摘要

Traditionally, computationally intense algebraic functions such as LU factorization are solved using complex systems such as supercomputers, parallel processing systems, and non-dedicated computing clusters. While these solutions are adequate for some problems, they typically suffer from classic parallel processing issues such as communication overhead, complex scheduling algorithms, and cost. Moreover, they are not feasible for embedded applications. Extremely high performance solutions are sometimes implemented using costly, custom hardware such as Application Specific Integrated Circuits (ASICs). Unfortunately, the design, implementation, and verification of ASICs has become cost prohibitive and such solutions are only feasible if the end design is to be manufactured in very high volumes. As a result, many proposed architectures to solve specific problems lie dormant because they are simply too expensive to realize.In recent years, advancements in Field Programmable Gate Array (FPGA) technology allow engineers to map complex algorithms to logic gates while achieving performance similar to ASIC technology. This thesis demonstrates the feasibility of the implementation of a three dimensional pipeline designed to solve LU factorization using FPGAs based on an architecture proposed nearly 10 years ago when a technology to implement such an architecture either did not exist or was too costly to implement.
机译:传统上,使用复杂的系统(例如超级计算机,并行处理系统和非专用计算集群)来解决计算强度大的代数函数(例如LU分解)。尽管这些解决方案足以解决某些问题,但它们通常会遇到经典的并行处理问题,例如通信开销,复杂的调度算法和成本。而且,它们对于嵌入式应用程序是不可行的。有时会使用昂贵的定制硬件(例如专用集成电路(ASIC))来实现极高性能的解决方案。不幸的是,ASIC的设计,实现和验证已变得成本过高,而这种解决方案只有在最终设计要大量生产时才可行。结果,许多解决特定问题的提议架构处于休眠状态,因为它们太昂贵了,无法实现。近年来,现场可编程门阵列(FPGA)技术的进步使工程师能够将复杂算法映射到逻辑门,同时获得与ASIC技术。本文证明了设计基于FPGA的三维流水线的可行性,该流水线基于基于近10年前提出的架构的FPGA来解决LU分解,当时不存在实现这种架构的技术或实施成本太高。

著录项

  • 作者

    Henciak Edward Thomas;

  • 作者单位
  • 年度 2008
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号