Performance Comparison of Serial and Parallel Multipliers in Massively Parallel Environment

机译：串行平行环境中串行乘法器的性能比较

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Computational environment of Deep Learning Neural Networks (DLNNs) is considerably different than that of Conventional computer systems. DLNNs require thousands, if not millions of compute cores compared to one or few in conventional systems. Therefore, there is a need to review the performance issue to gain better understanding of how systems behave in such massively parallel architectures. Precision, speed, memory access, bus contention, resource sharing, chip area etc are some of the key issues that need to be studied in the changed context. Low precision multiplication remains one of the commonly used operations in neural computations. This paper draws reader attention to some interesting results in area-speed tradeoffs when applied to massively parallel architectures. A new low precision fixed point representation is discussed. A hardware accelerators and its software components used in the simulation are briefly discussed. Results show that serial multipliers can perform better than parallel multipliers considering the throughput per unit area of Silicon.

机译：深度学习神经网络（DLNNS）的计算环境比传统计算机系统相比不同。如果不是数百万计数核心，则DLNNS需要数千个，而传统系统中的一个或多个。因此，需要审查性能问题，以更好地了解系统在这种大规模并行架构中的行为方式。精度，速度，内存访问，总线争用，资源共享，芯片区域等是在更改的上下文中需要研究的一些关键问题。低精度乘法仍然是神经计算中常用的操作之一。本文在应用于大规模平行架构时，读者对面积速度折衷的一些有趣的结果引起了一些有趣的结果。讨论了新的低精度定点表示。简要讨论用于模拟中使用的硬件加速器及其软件组件。结果表明，考虑到每单位硅的吞吐量，串行乘法器可以比并行乘法器更好。

著录项

来源
《International Conference on Electrical, Electronics, Communication, Computer, and Optimization Techniques》|2018年|1 v.|共6页
会议地点
作者
Shilpa Mayannavar; Uday Wali;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机网络;
关键词
Silicon; Clocks; Hardware; Program processors; Computer architecture; Adders; Neural networks;

机译：硅;时钟;硬件;程序处理器;计算机架构;加法器;神经网络;

相似文献

外文文献
中文文献
专利

1. Performance evaluation and comparison of parallel conjugate gradient on modern multi-core accelerator and massively parallel systems [J] . Fadi N. Sibai, Ali El-Moursy Parallel Algorithms and Applications . 2014,第1a2期

机译：现代多核加速器和大规模并行系统上并行共轭梯度的性能评估和比较
2. Real-time parallel implementation of Pulse-Doppler radar signal processing chain on a massively parallel machine based on multi-core DSP and Serial RapidIO interconnect [J] . Abdessamad Klilou, Said Belkouch, Philippe Elleaume, EURASIP journal on advances in signal processing . 2014,第1期

机译：基于多核DSP和串行RapidIO互连的大规模并行机上脉冲多普勒雷达信号处理链的实时并行实现
3. Massive Parallelization of Serial Inference Algorithms for a Complex Generalized Linear Model [J] . MARC A. SUCHARD, SHAWN E. SIMPSON, IVAN ZORYCH, ACM Transactions on Modeling and Computer Simulation . 2013,第1期

机译：复杂广义线性模型的串行推理算法的大规模并行化
4. Performance Comparison of Serial and Parallel Multipliers in Massively Parallel Environment [C] . Shilpa Mayannavar, Uday Wali International Conference on Electrical, Electronics, Communication, Computer, and Optimization Techniques . 2018

机译：大规模并行环境中串行和并行乘法器的性能比较
5. A comparison of FPGA implementations of hybrid serial-parallel multiplier. [D] . Babu, Biju. 2010

机译：混合串行并行乘法器的FPGA实现比较。
6. Massive parallelization of serial inference algorithms for a complex generalized linear model [O] . Marc A. Suchard, Shawn E. Simpson, Ivan Zorych, -1

机译：的串行推理算法大规模并行化了复杂广义线性模型
7. Performance Study of The Modified Fast Serial Parallel Multiplier with RECO Technique.(Dept.E) [O] . Ali El-Desoky, Aida Abd El-Gawad, Yasser El-Sheshnagy 1994

机译：具有RECO技术的改进快速串行乘法器的性能研究。（DEPT.E）

Performance Comparison of Serial and Parallel Multipliers in Massively Parallel Environment

摘要

著录项

相似文献

相关主题

期刊订阅