首页> 外国专利> FP16-S7E8 MIXED-PRECISION FOR DEEP LEARNING AND OTHER ALGORITHMS

FP16-S7E8 MIXED-PRECISION FOR DEEP LEARNING AND OTHER ALGORITHMS

机译：FP16-S7E8用于深度学习和其他算法的混合精度

页面导航

摘要
著录项
相似文献

摘要

Disclosed embodiments relate to mixed-precision vector multiply-accumulate (MPVMAC) In one example, a processor includes fetch circuitry to fetch a compress instruction having fields to specify locations of a source vector having N single-precision formatted elements, and a compressed vector having N neural half-precision (NHP) formatted elements, decode circuitry to decode the fetched compress instruction, execution circuitry to respond to the decoded compress instruction by: converting each element of the source vector into the NHP format and writing each converted element to a corresponding compressed vector element, wherein the processor is further to fetch, decode, and execute a MPVMAC instruction to multiply corresponding NHP-formatted elements using a 16-bit multiplier, and accumulate each of the products with previous contents of a corresponding destination using a 32-bit accumulator.

机译：公开的实施例涉及混合精度矢量乘累加（MPVMAC）。在一个示例中，处理器包括获取电路以获取具有字段以指定具有N个单精度格式化元素的源矢量的位置的压缩指令，以及N个神经半精度（NHP）格式的元素，解码电路以对提取的压缩指令进行解码，执行电路以通过以下方式对解码的压缩指令作出响应：将源向量的每个元素转换为NHP格式，并将每个转换后的元素写入对应的压缩向量元素，其中处理器进一步提取，解码和执行MPVMAC指令，以使用16位乘法器将相应的NHP格式的元素相乘，并使用32位乘法器将每个乘积与相应目标的先前内容进行累加位累加器。

著录项

公开/公告号EP3620910A1

专利类型
公开/公告日2020-03-11

原文格式PDF
申请/专利权人 INTEL CORPORATION;
展开▼

申请/专利号EP20190183087
发明设计人 KASHYAP SIDHARTH N.;LEPPER ANGUS;BOYLE PETER;
展开▼

申请日2019-06-28
分类号G06F7/483;
国家 EP
入库时间 2022-08-21 11:40:06

相似文献

专利
外文文献
中文文献