首页> 外文期刊>IEEE transactions on very large scale integration (VLSI) systems >TxSim: Modeling Training of Deep Neural Networks on Resistive Crossbar Systems
【24h】

TxSim: Modeling Training of Deep Neural Networks on Resistive Crossbar Systems

机译:TXSIM:电阻横杆系统深神经网络建模训练

获取原文
获取原文并翻译 | 示例

摘要

Deep neural networks (DNNs) have gained tremendous popularity in recent years due to their ability to achieve superhuman accuracy in a wide variety of machine learning tasks. However, the compute and memory requirements of DNNs have grown rapidly, creating a need for energy-efficient hardware. Resistive crossbars have attracted significant interest in the design of the next generation of DNN accelerators due to their ability to natively execute massively parallel vector-matrix multiplications within dense memory arrays. However, crossbar-based computations face a major challenge due to device and circuit-level nonidealities, which manifest as errors in the vector-matrix multiplications and eventually degrade DNN accuracy. To address this challenge, there is a need for tools that can model the functional impact of nonidealities on DNN training and inference. Existing efforts toward this goal are either limited to inference or are too slow to be used for large-scale DNN training. We propose TxSim, a fast and customizable modeling framework to functionally evaluate DNN training on crossbar-based hardware considering the impact of nonidealities. The key features of TxSim that differentiate it from prior efforts are: 1) it comprehensively models nonidealities during all training operations (forward propagation, backward propagation, and weight update) and 2) it achieves computational efficiency by mapping crossbar evaluations to well-optimized Basic Linear Algebra Subprograms (BLAS) routines and incorporates speedup techniques to further reduce simulation time with minimal impact on accuracy. TxSim achieves 6x-108x improvement in simulation speed over prior works, and thereby makes it feasible to evaluate the training of large-scale DNNs on crossbars. Our experiments using TxSim reveal that the accuracy degradation in DNN training due to nonidealities can be substantial (3%36.4%) for large-scale DNNs and data sets, underscoring the need for further research in mitigation techniques. We also analyze the impact of various device and circuit- level parameters and the associated nonidealities to provide key insights that can guide the design of crossbar-based DNN training accelerators.
机译:由于他们在各种机器学习任务中实现超人准确性,深度神经网络(DNNS)近年来越来越受欢迎。但是,DNN的计算和内存要求已迅速生长,以需要节能硬件。由于它们在密集的存储器阵列内本地执行大规模并行矢量 - 矩阵乘法,电阻式横梁对下一代DNN加速器的设计引起了重要兴趣。然而,基于横杆的计算面临着由于设备和电路级的非前沿而面临的重大挑战,其作为载体矩阵乘法中的误差显现出并且最终降低DNN精度。为解决这一挑战,需要工具,可以在DNN培训和推理中模拟非前沿的功能影响。对此目标的现有努力是限于推断,或者太慢用于大规模DNN培训。我们提出了TXSIM,一种快速和可定制的建模框架,以便在考虑非前熟的影响的基于横杆的硬件上的DNN培训。将其与事先努力区分开的TXSIM的关键特征是:1)在所有训练操作(前向传播,向后传播和权重更新)和2)中,它全面模型非前沿,它通过将横杆评估映射到优化的基本来实现计算效率线性代数子程序(BLA)例程并结合加速技术,以进一步降低模拟时间,对准确性的影响最小。 TXSIM在现有作品上实现了仿真速度的6倍108倍,从而可以评估在横梁上的大规模DNN训练。我们使用TXSIM的实验表明,对于非前熟,DNN培训的准确性降解可能是大型DNN和数据集的实质性(3%36.4%),强调需要进一步研究缓解技术。我们还分析了各种设备和电路级参数的影响以及相关的非侵入性,提供了可以指导基于横杆的DNN训练加速器设计的关键见解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号