首页> 外文学位 >Mutual algorithm-architecture analysis for real-time parallel systems in particle physics experiments.
【24h】

Mutual algorithm-architecture analysis for real-time parallel systems in particle physics experiments.

机译:粒子物理实验中实时并行系统的相互算法-架构分析。

获取原文
获取原文并翻译 | 示例

摘要

Data acquisition from particle colliders requires real-time detection of tracks and energy clusters from collision events occurring at intervals of tens of {dollar}mu{dollar}s. Beginning with the specification of a benchmark track-finding algorithm, parallel implementations have been developed. A revision of the routing scheme for performing reductions such as a tree sum, called the reduced routing distance scheme, has been developed and analyzed. The scheme reduces inter-PE communication time for narrow communication channel systems. A new parallel algorithm, called the interleaved tree sum, for parallel reduction problems has been developed that increases efficiency of processor use. Detailed analysis of this algorithm with different routing schemes is presented. Comparable parallel algorithms are analyzed, also taking into account the architectural parameters that play an important role in this parallel algorithm analysis. Computation and communication times are analyzed to guide the design of a custom system based on a massively parallel processing component. Developing an optimal system requires mutual analysis of algorithm and architecture parameters. It is shown that matching a processor array size to the parallelism of the problem does not always produce the best system design. Based on promising benchmark simulation results, an application specific hardware prototype board, called Dasher, has been built using two Blitzen chips. The processing array is a mesh-connected SIMD system with 256 PEs. Its design is discussed, with details on the software environment.
机译:从粒子对撞机获取数据需要实时检测轨迹和能量簇,这些轨迹和能量簇的间隔为几十美元。从基准跟踪算法的规范开始,已经开发了并行实现。已经开发并分析了用于执行诸如树之类的归约的路由选择方案的修订版,称为缩减路由选择距离方案。该方案减少了窄通信信道系统的PE间通信时间。已经开发出一种新的并行算法,称为交错树总和,用于并行减少问题,可以提高处理器的使用效率。给出了该算法在不同路由方案下的详细分析。分析了可比的并行算法,同时考虑了在此并行算法分析中起重要作用的体系结构参数。分析计算和通信时间以指导基于大规模并行处理组件的定制系统的设计。开发最佳系统需要相互分析算法和体系结构参数。结果表明,将处理器阵列的大小与问题的并行性相匹配并不总是能够产生最佳的系统设计。基于有希望的基准仿真结果,使用两个Blitzen芯片构建了名为Dasher的专用硬件原型板。该处理阵列是具有256个PE的网状连接SIMD系统。讨论了其设计,并详细介绍了软件环境。

著录项

  • 作者

    Ni, Ping.;

  • 作者单位

    North Carolina State University.;

  • 授予单位 North Carolina State University.;
  • 学科 Computer Science.; Physics Elementary Particles and High Energy.
  • 学位 Ph.D.
  • 年度 1994
  • 页码 138 p.
  • 总页数 138
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;高能物理学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号