首页> 外国专利> FP16-S7E8 mixed precision for deep learning and other algorithms

FP16-S7E8 mixed precision for deep learning and other algorithms

机译：FP16-S7E8深度学习和其他算法的混合精度

页面导航

摘要
著录项
相似文献

摘要

Disclosed embodiments relate to mixed-precision vector multiply-accumulate (MPVMAC) In one example, a processor includes fetch circuitry to fetch a compress instruction having fields to specify locations of a source vector having N single-precision formatted elements, and a compressed vector having N neural half-precision (NHP) formatted elements, decode circuitry to decode the fetched compress instruction, execution circuitry to respond to the decoded compress instruction by: converting each element of the source vector into the NHP format and writing each converted element to a corresponding compressed vector element, wherein the processor is further to fetch, decode, and execute a MPVMAC instruction to multiply corresponding NHP-formatted elements using a 16-bit multiplier, and accumulate each of the products with previous contents of a corresponding destination using a 32-bit accumulator.

机译：所公开的实施例涉及混合精度矢量乘法（MPVMAC）在一个示例中，处理器包括提取电路，以获取具有字段的压缩指令，以指定具有N单精度格式化元素的源矢量的位置，以及压缩向量的源矢量 N神经半精度（NHP）格式化元素，解码电路解码获取的压缩指令，执行电路响应解码的压缩指令：将源向量的每个元素转换为NHP格式并将每个转换元素写入对应的压缩矢量元素，其中处理器还用于使用16位乘法器将相应的NHP格式的元素乘以相应的NHP格式的元件，并使用32-使用相应目的地的先前内容累积每个产品位累加器。

著录项

公开/公告号US11093579B2

专利类型
公开/公告日2021-08-17

原文格式PDF
申请/专利权人 INTEL CORPORATION;
展开▼

申请/专利号US201816122030
发明设计人 SIDHARTH N. KASHYAP;ANGUS LEPPER;PETER BOYLE;
展开▼

申请日2018-09-05
分类号G06F17/16;G06F7/544;G06N3/08;G06F9/30;G06N5/04;G06N3/063;G06N3/04;
国家 US
入库时间 2022-08-24 20:37:31

相似文献

专利
外文文献
中文文献