Efficient mapping of randomly sparse neural networks on parallel vector supercomputers

机译：在并行矢量超计算机上随机稀疏神经网络的高效映射

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents efficient mappings of large sparse neural networks on a distributed-memory MIMD multicomputer with high performance vector units. We develop parallel vector code for an idealized network and analyze its performance. Our algorithms combine high performance with a reasonable memory requirement. Due to the high cost of scatter/gather operations, generating high performance parallel vector code requires careful attention to details of the representation. We show that vectorization can nevertheless more than quadruple the performance on our modeled supercomputer. Pushing several patterns at a time through the network (batch mode) exposes an extra degree of parallelism which allows us to improve the performance by an additional factor of 4. Vectorization and batch updating therefore yield an order of magnitude performance improvement.

机译：本文介绍了具有高性能向量单元的分布式内存MIMD多电脑的大稀疏神经网络的有效映射。我们为理想化的网络开发并行矢量代码并分析其性能。我们的算法与合理的内存要求相结合了高性能。由于散射/采集操作的高成本，生成高性能并行矢量代码需要仔细关注表示表示的细节。我们表明，VecsiveIzation仍然可以是我们在模型超级计算机上的性能方面的比例。通过网络（批量模式）一次按下几个模式暴露额外的并行度，这使我们能够通过额外的因子来提高性能4.矢量化和批量更新，因此产生了一个数量级性能改进的顺序。

著录项

来源
《IEEE symposium on parallel and distributed processing》|1994年||共8页
会议地点
作者
Muller S.M.; Gomes B.; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Comparing fully convolutional networks, random forest, support vector machine, and patch-based deep convolutional neural networks for object-based wetland mapping using images from small unmanned aircraft system [J] . Liu Tao, Abd-Elrahman Amr, Morton Jon, GIScience & remote sensing . 2018,第2期

机译：比较全卷积网络，随机森林，支持向量机和基于补丁的深度卷积神经网络，使用来自小型无人机系统的图像进行基于对象的湿地映射
2. Efficient Neural Network Implementations on Parallel Embedded Platforms Applied to Real-Time Torque-Vectoring Optimization Using Predictions for Multi-Motor Electric Vehicles [J] . Martin Dendaluce Jahnke, Francesco Cosco, Rihards Novickis, Electronics . 2019,第2期

机译：基于预测的多电机电动汽车实时扭矩矢量优化的并行嵌入式平台上的高效神经网络实现
3. Efficient sparse matrix factorization for circuit simulation on vector supercomputers [J] . Sadayappan P., Visvanathan V. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems . 1989,第12期

机译：向量超级计算机上电路仿真的有效稀疏矩阵分解
4. Efficient mapping of randomly sparse neural networks on parallel vector supercomputers [C] . Muller, S.M., Gomes, . 1994

机译：并行向量超级计算机上随机稀疏神经网络的有效映射
5. Mapping sparse matrix scientific applications onto FPGA-augmented reconfigurable supercomputers [D] . Morris, Gerald R. 2006

机译：将稀疏矩阵科学应用程序映射到FPGA增强的可重构超级计算机
6. An efficient and accurate solver for large sparse neural networks [O] . Roman M Stolyarov, Andrea K Barreiro, Scott Norris 2015

机译：适用于大型稀疏神经网络的高效且精确的求解器
7. Efficient Mapping of Randomly Sparse Neural Networks on Parallel Vector Supercomputers [O] . Silvia M. Müller, Benedict Gomes 1994

机译：向量并行超级计算机上随机稀疏神经网络的有效映射

Efficient mapping of randomly sparse neural networks on parallel vector supercomputers

摘要

著录项

相似文献

相关主题

期刊订阅