Coding for Parallel Execution of Hardware-in-the-Loop Millimeter Wave Scene Generation Models on Multi-Core SIMD Processor Architectures

机译：在多核SIMD处理器体系结构上并行执行在环硬件毫米波场景生成模型的编码

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Rendering of point scatterer based radar scenes for millimeter wave (mmW) seeker tests in real-time hardware-in-the-loop (HWIL) scene generation requires efficient algorithms and vector-friendly computer architectures for complex signal synthesis. New processor technology from Intel implements an extended 256-bit vector SIMD instruction set (AVX, AVX2) in a multi-core CPU design providing peak execution rates of hundreds of GigaFLOPS (GFLOPS) on one chip. Real world mmW scene generation code can approach peak SIMD execution rates only after careful algorithm and source code design. An effective software design will maintain high computing intensity emphasizing register-to-register SIMD arithmetic operations over data movement between CPU caches or off-chip memories. Engineers at the U.S. Army Aviation and Missile Research, Development and Engineering Center (AMRDEC) applied two basic parallel coding methods to assess new 256-bit SIMD multi-core architectures for mmW scene generation in HWIL. These include use of POSEX threads built on vector library functions and more portable, high-level parallel code based on compiler technology (e.g. OpenMP pragmas and SIMD autovectorization). Since CPU technology is rapidly advancing toward high processor core counts and TeraFLOPS peak SIMD execution rates, it is imperative that coding methods be identified which produce efficient and maintainable parallel code. This paper describes the algorithms used in point scatterer target model rendering, the parallelization of those algorithms, and the execution performance achieved on an AVX multi-core machine using the two basic parallel coding methods. The paper concludes with estimates for scale-up performance on upcoming multi-core technology.

机译：在实时硬件在环（HWIL）场景生成中，渲染基于点散射体的雷达场景以进行毫米波（mmW）导引头测试，需要高效的算法和矢量友好的计算机体系结构来进行复杂的信号合成。英特尔的新处理器技术在多核CPU设计中实现了扩展的256位矢量SIMD指令集（AVX，AVX2），在一个芯片上提供了数百个GigaFLOPS（GFLOPS）的峰值执行率。实际的mmW场景生成代码只有经过仔细的算法和源代码设计，才能达到SIMD的峰值执行率。有效的软件设计将保持较高的计算强度，重点是在CPU高速缓存或片外存储器之间进行数据移动时寄存器到寄存器SIMD算术运算。美国陆军航空与导弹研究，开发和工程中心（AMRDEC）的工程师应用了两种基本的并行编码方法来评估用于HWIL中mmW场景生成的新256位SIMD多核体系结构。其中包括使用基于矢量库函数构建的POSEX线程以及基于编译器技术（例如OpenMP编译指示和SIMD自动矢量化）的更可移植的高级并行代码。由于CPU技术正朝着更高的处理器内核数量和TeraFLOPS SIMD峰值执行率快速发展，因此必须确定能够产生有效且可维护的并行代码的编码方法。本文介绍了点散射体目标模型渲染中使用的算法，这些算法的并行化以及使用两种基本并行编码方法在AVX多核计算机上实现的执行性能。本文以对即将到来的多核技术的放大性能的估计作为结束。

著录项

来源
《Technologies for synthetic environments: hardware-in-the-loop XVIII》|2013年|87070D.1-87070D.11|共11页
会议地点 Baltimore MD(US)
作者
Richard F. Olson Jr.;
展开▼
作者单位

Air and Missile Defense Simulations SSDD, United States Army AMRDEC Redstone Arsenal, Alabama 35898;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Radar Simulation; Doppler Beam Sharpening; Hardware-in-the-Loop; Point Scatterer Modeling; Radar Scene Generation; Vector Signal Processing;

机译：雷达仿真多普勒光束锐化；硬件在环点散射体建模；雷达场景生成；矢量信号处理;

相似文献

外文文献
中文文献
专利

1. Parallelizing and optimizing neural Encoder-Decoder models without padding on multi-core architecture [J] . Yuchen Qiao, Kazuma Hashimoto, Akiko Eriguchi, Future generation computer systems . 2020,第Jula期

机译：并行化和优化神经编码器-解码器模型，而无需在多核体系结构上进行填充
2. Design Methodology of the Heterogeneous Multi-core Processor With the Combination of Parallelized Multi-core Simulator and Common Register File-Based Instruction Set Extension Architecture [J] . Bingbing Xia, Fei Qiao, Huazhong Yang, Journal of Computers . 2013,第2期

机译：异构多核处理器的设计方法，具有并行化多核模拟器和基于公共寄存器文件指令集扩展架构的组合
3. RTOS support for execution of parallelized hard real-time tasks on the MERASA multi-core processor [J] . Julian Wolf, Mike Gerdes, Florian Kluge, International Journal of Computer Systems Science & Engineering . 2011,第6期

机译：RTOS支持在MERASA多核处理器上执行并行的硬实时任务
4. Coding for Parallel Execution of Hardware-in-the-Loop Millimeter Wave Scene Generation Models on Multi-Core SIMD Processor Architectures [C] . Richard F. Olson Jr. Conference on technologies for synthetic environments: hardware-in-the-loop XVIII . 2013

机译：编码多核SIMD处理器架构上的硬件in-Loop毫米波场景生成模型的并行执行编码
5. Exploiting multi-core processors for the service oriented architecture paradigm: Parallel XML processing and concurrent service orchestration. [D] . Lu, Wei. 2009

机译：为面向服务的体系结构范例开发多核处理器：并行XML处理和并发服务编排。
6. A Parallel Architecture for the Partitioning around Medoids (PAM) Algorithm for Scalable Multi-Core Processor Implementation with Applications in Healthcare [O] . Hassan Mushtaq, Sajid Gul Khawaja, Muhammad Usman Akram, 2018

机译：围绕Medoids（PAM）算法进行分区的并行体系结构可实现可扩展的多核处理器及其在医疗保健中的应用
7. Parallelization of KMP String Matching Algorithm on Different SIMD Architectures: Multi-Core and GPGPUapos;s [O] . Akhtar Rasool, Nilay Khare 2012

机译：KMP字符串匹配算法在不同SIMD架构上的并行化：多核和GPGPU

Coding for Parallel Execution of Hardware-in-the-Loop Millimeter Wave Scene Generation Models on Multi-Core SIMD Processor Architectures

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅