首页> 外文会议>Ninth Workshop on Virtual Intelligence/Dynamic Neural Networks >Neuroprocessor NeuroMatrix NM6403 architectural overview
【24h】

Neuroprocessor NeuroMatrix NM6403 architectural overview

机译:神经处理器NeuroMatrix NM6403体系结构概述

获取原文
获取原文并翻译 | 示例

摘要

Abstract: The paper represents an architecture overview of the NeuroMatrix NM6403 neuroprocessor designed for 32-bit and 64-bit data processing. The paper includes brief description of the neuroprocessor pinout, structure and functional units. The neuroprocessor comprise original RISC core, vector coprocessor (VCP) and some peripheral units. RISC core provides general control functions, 32-bit program and data address generation, 32-bit arithmetic, logic and shift operations. The main neuroprocessor operational unit is VCP, applied for variable bit-length vector data arithmetic, logic and saturation operations. The base VCP operation is matrix by vector multiplication with accumulation. Each data vector is a 64-bit word of packed data word. It is formed by set of variable bit length operands with user defined bit length in a range from 1 to 64 bits. Neuroprocessor includes two external 64-bit buses. The programmable memory interface units allow to use static or dynamic memory having wide range of time parameters without external controller. The neuroprocessor support shared memory mode for each of the external buses. With conjunction of two byte width communication ports this one makes it easy to design multiprocessor systems. Also this paper represents addressing modes, instruction set, supported interrupts. The neuroprocessor is designed using CMOS 0.5 $mu@m technology, power supply voltage is 3.3 V, clock rate is 50 MHz with one instruction per clock cycle performance.!2
机译:摘要:本文代表了针对32位和64位数据处理而设计的NeuroMatrix NM6403神经处理器的体系结构概述。本文包括对神经处理器引脚,结构和功能单元的简要说明。神经处理器包括原始的RISC内核,矢量协处理器(VCP)和一些外围单元。 RISC内核提供通用控制功能,32位程序和数据地址生成,32位算术,逻辑和移位运算。神经处理器的主要运算单元是VCP,用于可变位长向量数据的算术,逻辑和饱和运算。基本的VCP运算是通过矢量乘以累加的矩阵。每个数据向量都是打包数据字的64位字。它由可变位长操作数集组成,用户定义的位长范围为1到64位。 Neuroprocessor包括两个外部64位总线。可编程存储器接口单元允许使用具有广泛时间参数的静态或动态存储器,而无需外部控制器。神经处理器支持每个外部总线的共享内存模式。结合两个字节宽度的通信端口,该端口使设计多处理器系统变得容易。本文还介绍了寻址模式,指令集,支持的中断。神经处理器采用CMOS 0.5 $μm技术设计,电源电压为3.3 V,时钟速率为50 MHz,每个时钟周期性能一条指令。2

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号